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MODIFIED INTERFERON ALPHA WITH REDUCED IMMUNOGENICITY 


FIELD OF THE INVENTION 

The present invention relates to polypeptides to be administered especially to humans and 
in particular for therapeutic use. The polypeptides are modified polypeptides whereby the 
modification results in a reduced propensity for the polypeptide to elicit an immune 
response upon administration to the human subject. The invention in particular relates to 
the modification of human interferon and specifically human interferon cc2 (INFa2) to 
result in INFa2 protein variants that are substantially non-immunogenic or less 
immunogenic than any non-modified counterpart when used in vivo. The invention 
relates furthermore to T-cell epitope peptides derived from said non-modified protein by 
means of which it is possible to create modified INFa2 variants with reduced 
immunogenicity. 


BACKGROUND OF THE INVENTION 

There are many instances whereby the efficacy of a therapeutic protein is limited by an 
unwanted immune reaction to the therapeutic protein. Several mouse monoclonal 
antibodies have shown promise as therapies in a number of human disease settings but in 
certain cases have failed due to the induction of significant degrees of a human anti- 
murine antibody (HAMA) response [Schroff, R. W. et al (1985) Cancer Res, 45: 879-885; 
Shawler, D.L. et al (1985) J. Immunol 135: 1530-1535]. For monoclonal antibodies, a 
number of techniques have been developed in attempt to reduce the HAMA response 
[WO 89/09622; EP 0239400; EP 0438310; WO 91/06667]. These recombinant DNA 
approaches have generally reduced the mouse genetic information in the final antibody 
construct whilst increasing the human genetic information in the final construct. 
Notwithstanding, the resultant "humanized'' antibodies have, in several cases, still elicited 
an immune response in patients [Issacs J.D. (1990) Sem. Immunol. 2: 449, 456; Rebello, 
P.R. et al (1999) Transplantation 68: 1417-1420]. 

Antibodies are not the only class of polypeptide molecule administered as a therapeutic 
agent against which an immune response may be mounted. Even proteins of human 
origin and with the same amino acid sequences as occur within humans can still induce an 
immune response in humans. Notable examples include the therapeutic use of 

CONFIRMATION COPY 


WO 02/085941 


- 2 - 


PCT/EP02/02218 


granulocyte-macrophage colony stimulating factor [Wadhwa, M. et al (1999) Clin, 
Cancer Res. 5: 1353-1361] and DSTFa2 [Russo, D. et al (1996) Bri. 7. Haem. 94: 300-305; 
Stein, R. et al (1988) New Engl /. Med. 318: 1409-1413] amongst others. 

5 A principal factor in the induction of an immune response is the presence within the 
protein of peptides that can stimulate the activity of T-cells via presentation on MHC 
class II molecules, so-called "T-cell epitopes". Such potential T-cell epitopes are 
commonly defined as any amino acid residue sequence with the ability to bind to MHC 
Class II molecules. Such T-cell epitopes can be measured to establish MHC binding. 

10 Implicitly, a "T-cell epitope" means an epitope which when bound to MHC molecules can 
be recognized by a T-cell receptor (TCR), and which can, at least in principle, cause the 
activation of these T-cells by engaging a TCR to promote a T-cell response. It is, 
however, usually understood that certain peptides which are found to bind to MHC Class 
II molecules may be retained in a protein sequence because such peptides are recognized 

15 as "self within the organism into which the final protein is administered. 

It is known, that certain of these T-cell epitope peptides can be released during the 
degradation of peptides, polypeptides or proteins within cells and subsequently be 
presented by molecules of the major histocompatability complex (MHC) in order to 
20 trigger the activation of T-cells. For peptides presented by MHC Class II, such activation 
of T-cells can then give rise, for example, to an antibody response by direct stimulation of 
B-cells to produce such antibodies. 

MHC Class II molecules are a group of highly polymorphic proteins which play a central 
25 role in helper T-cell selection and activation. The human leukocyte antigen group DR 
(HLA-DR) are the predominant isotype of this group of proteins and are the major focus 
of the present invention. However, isotypes HLA-DQ and HLA-DP perform similar 
functions, hence the present invention is equally applicable to these. The MHC class II 
DR molecule is made of an alpha and a beta chain which insert at their C-termini through 
30 the cell membrane. Each hetero-dimer possesses a ligand binding domain which binds to 
peptides varying between 9 and 20 amino acids in length, although the binding groove 
can accommodate a maximum of 11 amino acids. The ligand binding domain is 
comprised of amino acids 1 to 85 of the alpha chain, and amino acids 1 to 94 of the beta 
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chain. DQ molecules have recently been shown to have an homologous structure and the 
DP family proteins are also expected to be very similar. In humans approximately 70 
different allotypes of the DR isotype are known, for DQ there are 30 different allotypes 
and for DP 47 different allotypes are known. Each individual bears two to four DR 

5 alleles, two DQ and two DP alleles. The structure of a number of DR molecules has been 
solved and such structures point to an open-ended peptide binding groove with a number 
of hydrophobic pockets which engage hydrophobic residues (pocket residues) of the 
peptide [Brown et al Nature (1993) 364: 33; Stern et al (1994) Nature 368: 215]. 
Polymorphism identifying the different allotypes of class II molecule contributes to a 

10 wide diversity of different binding surfaces for peptides within the peptide binding grove 
and at the population level ensures maximal flexibility with regard to the ability to 
recognize foreign proteins and mount an immune response to pathogenic organisms. 
There is a considerable amount of polymorphism within the ligand binding domain with 
distinct "families" within different geographical populations and ethnic groups. This 

15 polymorphism affects the binding characteristics of the peptide binding domain, thus 
different "families" of DR molecules will have specificities for peptides with different 
sequence properties, although there may be some overlap. This specificity determines 
recognition of Th-cell epitopes (Class II T-cell response) which are ultimately responsible 
for driving the antibody response to B-cell epitopes present on the same protein from 

20 which the Th-cell epitope is derived. Thus, the immune response to a protein in an 

individual is heavily influenced by T-cell epitope recognition which is a function of the 
peptide binding specificity of that individual's HLA-DR allotype. Therefore, in order to 
identify T-cell epitopes within a protein or peptide in the context of a global population, it 
is desirable to consider the binding properties of as diverse a set of HLA-DR allotypes as 

25 possible, thus covering as high a percentage of the world population as possible. 

An immune response to a therapeutic protein such as the protein which is object of this 
invention, proceeds via the MHC class II peptide presentation pathway. Here exogenous 
proteins are engulfed and processed for presentation in association with MHC class II 
30 molecules of the DR, DQ or DP type. MHC Class II molecules are expressed by 

professional antigen presenting cells (APCs), such as macrophages and dendritic cells 
amongst others. Engagement of a MHC class IT peptide complex by a cognate T-cell 
receptor on the surface of the T-cell, together with the cross-binding of certain other co- 
receptors such as the CD4 molecule, can induce an activated state within the T-cell. 
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Activation leads to the release of cytokines further activating other lymphocytes such as B 
cells to produce antibodies or activating T killer cells as a full cellular immune response. 
The ability of a peptide to bind a given MHC class II molecule for presentation on the 
surface of an APC is dependent on a number of factors most notably its primary 
5 sequence. This will influence both its propensity for proteolytic cleavage and also its 
affinity for binding within the peptide binding cleft of the MHC class II molecule. The 
MHC class II / peptide complex on the APC surface presents a binding face to. a particular 
T-cell receptor (TCR) able to recognize determinants provided both by exposed residues 
of the peptide and the MHC class II molecule. 

10 

In the art there are procedures for identifying synthetic peptides able to bind MHC class II 
molecules (e.g. W098/52976 and WO00/34317). Such peptides may not function as T- 
cell epitopes in all situations, particularly, in vivo due to the processing pathways or other 
phenomena. T-cell epitope identification is the first step to epitope elimination. The 

15 identification and removal of potential T-cell epitopes from proteins has been previously 
disclosed. In the art methods have been provided to enable the detection of T-cell epitopes 
usually by computational means scanning for recognized sequence motifs in 
experimentally determined T-cell epitopes or alternatively using computational 
techniques to predict MHC class II-binding peptides and in particular DR-bindirig 

20 peptides. 

W098/52976 and WOOO/34317 teach computational threading approaches to identifying 
polypeptide sequences with the potential to bind a sub-set of human MHC class II DR 
allotypes. In these teachings, predicted T-cell epitopes are removed by the use of 
judicious amino acid substitution within the primary sequence of the therapeutic antibody 
25 or non-antibody protein of both non-human and human derivation. 

Other techniques exploiting soluble complexes of recombinant MHC molecules in 
combination with synthetic peptides and able to bind to T-cell clones from peripheral 
blood samples from human or experimental animal subjects have been used in the art 
30 [Kern, F. et al (1998) Nature Medicine 4:975-978; Kwok, W.W. et al (2001) TRENDS in 
Immunology 22: 583-588] and may also be exploited in an epitope identification strategy. 
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As depicted above and as consequence thereof, it would be desirable to identify and to 
remove or at least to reduce T-cell epitopes from a given in principal therapeutically 
valuable but originally immunogenic peptide, polypeptide or protein. 

5 One of these therapeutically valuable molecules is INFa2. The molecule is an important 
glycoprotein cytokine expressed by activated macrophages. The protein has antiviral 
activity and stimulates the production of at least two enzymes; a protein kinase and an 
oligoadenylate synthetase, on binding to the interferon alpha receptor in expressing cells. 
The mature ESTFa2 protein is single polypeptide of 165 amino acids produced by post- 
10 translational processing of a 188 amino acid pre-cursor protein by cleavage of a 23 amino 
acid signal sequence from the amino terminus. Several different subtypes of human 
INPa2 are known showing minor differences between primary amino acid sequences. 
Thus INFa2a and INFa2b differ in only one residue at position 23 of the mature protein 
. chain being lysine in INFa2a and arginine in INFcc2b. Whilst the disclosures of the 
15 present invention are directed towards the sequence of INFoc2b, it can be seen that for all 
practical purposes the sequence of ENFoc2a may be considered interchangeably with the 
subject INFa2b subtype of the present invention. The amino acid sequence of 
INFa2(a,b) (depicted as one-letter code) is as follows: 

20 CDLPQTHSLGSRRTLMLLAQMR (R, K) ISLFSCLKDRHDFGFPQEEFGNQFQKAETIPVL 
HEMIQQI FNLFSTKD S S AAWDETLLDKF YTEL YQQLNDLE AC VI QGVGVTETPLMKED S I 
LAVRKYFQRITLYLKEKKYSPCAWEVVRAEIMRSFSLSTNLQESLRSKE 

The protein has considerable clinical importance as a broad spectrum anti-viral, anti- 
25 proliferative and immunomodulating agent. Recombinant and other preparations of 

INFoc2 have been used therapeutically in a variety of cancer and viral indications in man 
[reviewed in Sen, G.G. and Lengyel P, (1992), 7. Biol Chem. 267: 5017-5020]. However 
despite very significant therapeutic benefit to large numbers of patients, resistance to 
therapy in certain patients has been documented and one important mechanism of 
30 resistance has been shown to be the development of neutralising antibodies detectable in 
the serum of treated patients [Quesada, J.R. et al (1985) J.Clin. Oncology 3:1522-1528; 
Stein R.G. et al (1988) ibid; Russo, D. et al (1996) ibid; Brooks M.G. et al (1989) Gut 30: 
1116-1122]. An immune response in these patients is mounted to the therapeutic 
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interferon despite the fact that a molecule of at least identical primary structure is 
produced endogenously in man. 

Others have provided modified INFa2a and methods of use [US, 4,496,537; 
5 US,5,972,331; US,5,480,640; US,5,190,751; US ,4,959,2 10], but these approaches have 
been directed towards improvements in the commercial production of INFa2a, Such 
teachings do not recognize the importance of T-cell epitopes to the immunogenic 
properties of the protein nor have been conceived to directly influence said properties in a 
specific and controlled way according to the scheme of the present invention. 

10 

However, there is a continued need for INFa2a analogues with enhanced properties. 
Desired enhancements include alternative schemes and modalities for the expression and 
purification of the said therapeutic, but also and especially, improvements in the 
biological properties of the protein. There is a particular need for enhancement of the in 
15 vivo characteristics when administered to the human subject. In this regard, it is highly 
desired to provide DSFFa2a with reduced or absent potential to induce an immune 
response in the human subject. 

SUMMARY AND DESCRIPTION OF THE INVENTION 

20 The present invention provides for modified forms of human interferon a, and 
specifically the interferon a2 type, herein called "INFa2 n , in which the immune 
characteristic is modified by means of reduced or removed numbers of potential T-cell 
epitopes. 

The invention discloses sequences identified within the INFa2 primary sequence that are 
25 potential T-cell epitopes by virtue of MHC class II binding potential. This disclosure 

specifically pertains the human INFa2 protein being 165 amino acid residues. 

The invention discloses also specific positions within the primary sequence of the 

molecule which according to the invention are to be altered by specific amino acid 

substitution, addition or deletion without in principal affecting the biological activity. In 
30 cases in which the loss of immunogenicity can be achieved only by a simultaneous loss of 

biological activity it is possible to restore said activity by further alterations within the 

amino acid sequence of the protein. 
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The invention furthermore discloses methods to produce such modified molecules, and 
above all methods to identify said T-cell epitopes which require alteration in order to 
reduce or remove immunogenic sites. 

The protein according to this invention would expect to display an increased circulation 
5 time within the human subject and would be of particular benefit in chronic or recurring 
disease settings such as is the case for a number of indications for ESTFa2. The present 
invention provides for modified forms of INFa2 proteins that are expected to display 
enhanced properties in vivo. These modified INFoc2 molecules can be used in 
pharmaceutical compositions. 

10 

In summary the invention relates to the following issues: 

• a modified molecule having the biological activity of human interferon alpha 2 
(INFa2) and being substantially non-immunogenic or less immunogenic than any non- 
modified molecule having the same biological activity when used in vivo\ 

15 • a corresponding molecule, wherein said loss of immunogenicity is achieved by 
removing one or more T-cell epitopes, preferably one T-cell epitope, derived from the 
originally non-modified molecule and / or by reduction in numbers of MHC allotypes 
able to bind peptides derived from said molecule; 

• a corresponding molecule, wherein said originally present T-cell epitopes are MHC 
20 class II ligands or peptide sequences which show the ability to stimulate or bind T-cells 

via presentation on MHC class II; 

• a corresponding molecule, wherein said ligands or peptide sequences are 13mer or 
15mer peptides; 

• a correspondingly molecule, wherein said peptide sequences are selected from the 
25 group as depicted in Figure 1. 

• a corresponding molecule, wherein 1-9 amino acid residues, preferably one amino 
acid residue, in any of the originally present T-cell epitopes are altered; 

• a corresponding molecule, wherein the alteration of the amino acid residues is 
substitution, addition or deletion, preferably substitution, of originally present amino 

30 acid(s) residue(s) by other amino acid residue(s) at specific position(s); 

• a corresponding molecule, wherein one or more of the amino acid residue substitutions 
are made as indicated in Figure 2, and, in addition, optionally one or more of the amino 
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acid residue substitutions are carried out as indicated in Figure 3 for the reduction in the 
number of MHC allotypes able to bind peptides derived from said molecule; 

• a corresponding molecule, wherein additionally further alteration, such as substitution, 
addition or deletion is conducted to restore biological activity of said molecule; 

5 • a corresponding modified molecule, wherein the amino acid alteration is made with 
reference to an homologous protein sequence or with reference to in silico modeling 
techniques; 

• a modified molecule having the biological activity of human interferon alpha 2 
(INFoc2) and being substantially non-immuno genie or less immunogenic than any non- 
10 modified molecule having the same biological activity when used in vivo, obtainable by 

alteration of one or more amino acids in the primary sequence by (i) removing one or 
more T-cell epitopes derived from the originally non-modified molecule and being MHC 
class II ligands or peptide sequences which show the ability to stimulate or bind T-cells 
via presentation on MHC class II, and / or (ii) by reduction in numbers of MHC allotypes 
15 able to bind peptides derived from said molecule, wherein said modified molecule 

comprises alterations which are made at one or more positions within following strings of 
contiguous amino acid residues of said primary sequence derived from the INFO 2 wild- 
type: 

(a) I SLF S CLKDRHDFGF PQEEFGNQFQKAETI PVLH (Rl), 
20 (b) FNLFSTKDSSAAWDE (R2), 

(c) KEDSILAVRKYFQRITLY (R3); 

• a corresponding molecule, wherein said alteration is substitution of 1 - 9 amino acid 
residues; 

• a corresponding molecule, wherein said substitution is conducted at one or more amino 
25 acid residues from the strings Rl, R2 and R3, preferably R2 and R3, and more preferably 

R3; 

• a corresponding molecule, wherein additionally one or more substitutions of amino 
acid residues outside the sequence strings Rl, R2 or R3 are conducted; 

• a corresponding molecule comprising an amino acid residue substitution made at one 
30 or more positions in the wild-type molecule: 24, 26, 27, 38, 55, 63, 64, 66, 67, 76, 84, 85, 

89, 103, 110, 111, 116, 117, 119, 122, 123, 126, 128, 129, 130, 153, preferably at one or 
more of the following positions 26, 27, 38, 63, 85, 89, 103, 110, 111, 116, 117, 122, 123, 
126, 128, 153, more preferably 103, 110, 111, 116, 117, 122, 123, 126, 128, 153; 
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• a preferred embodiment, wherein said substitution is made at one or more positions 
selected from 26, 27, 38 and additionally at one or more positions selected from 103, 110, 
111, 116, 117, 122, 123, 126, 128, 153, or alternatively, selected from 63, 85, 89 and 103, 
110, 111, 116, 117, 122, 123, 126, 128, 153; 

5 • a corresponding molecule, wherein said substitution is made at one or more positions 
as specified in Figure 4; 

• a corresponding molecule, wherein said substitution is made at positions 

L26P, F27S, F38E and / or I63T, Y85S, Y89D, Y89E, Y89N and / or V103E, LI 10G, 
Mil IT, M111S, Ml HE, I116S, I116Q, L117G, L117A, Y122E, Y122Q, F123H, I126A, 
10 L128A,L153S; 

• a preferred corresponding molecule wherein said substitution is made at positions 
L26P, F27S, F38E and / or I63T, Y85S, Y89D, Y89E, Y89N and / or V103E, LI 10G, 
M111T, M111S, M111E, I116S, I116Q, L117G, L117A; 

• a modified molecule having the biological activity of human interferon alpha 2 

15 (INFa2) and being substantially non-immunogenic or less immunogenic than any non- 
modified molecule having the same biological activity when used in vivo, obtainable by 
substitution of one or more amino acids in the primary sequence by (i) removing one or 
more T-cell epitopes derived from the originally non-modified molecule and being MHC 
class II ligands or peptide sequences which show the ability to stimulate or bind T-cells 

20 via presentation on MHC class II, and / or (ii) by reduction in numbers of MHC allotypes 
able to bind peptides derived from said molecule, wherein said substitution is made at one 
or more positions in a wild-type molecule INFa2a or INFa2b corresponding to at least 
one of the groups selected from: 
(i) I24P,L26P,F27S,F38E, V55A, 

25 (ii) I63T, L66A, F67D, F67E, W76H, F84D, F84E, Y85S, Y89D, Y89E, Y89N, 
(iii) any position within sequence R3; 

• a corresponding molecule, whereby one or more of the following substitutions are 
made within sequence R3: V103E, L110G, L110S, Mil IT, Mil IS, Mil IE, J116S, 
I116Q, L117G, LI 17 A, VI 19 A, Y122Q, Y122E, Y122H, F123H, I126A, L128A, 

30 Y129N, L130G, L130, L153S, preferably Y122E, Y122Q, F123H, I126A, L128A, 

optionally containing additional amino acid residue alterations, preferably substitutions, 
which lead to a further diminished immunogenicity; 
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• a modified human interferon alpha 2 (INFa2) having reduced immunogenicity 
consisting of the following sequence: 

CDLPQTHSLGSRRTLMLLAQMRX°ISLFSCLKDRHDFGFPQEEFGNQFQKAETIPVLHEMIQQIFN 
LFSTKDSSAAWDETLLDKFYTELYQQLND^ 
5 YLKEKKY S PC AWE WRAE IMRS F S L S TNLQE S LRS KE , 
wherein X° is R, K; 

XMsY,E,Q; 

X 2 is F, H; 

X 3 is I, A; and 
10 X 4 is L, A; 

whereby simultaneously X 1 = Y, X 2 = F, X 3 = I and X 4 = L are excluded (this sequence 
corresponds to the wild-type IFNa2); ; j 

• a modified human interferon alpha 2 (INFa2) having reduced immunogenicity 

consisting of the following sequence: 
15 CDLPQTHSLGSRRTLMLLAQMRX°IS 

LFSTKDSSAAWDETLLDKFYTELYQQLND^ 

X 8 YLKEKKYS PCAWEVVRAEIMRSF SX 9 STNLQ ESLRSKE , 

wherein X° is R, K; 

X 1 is L, S, G, 
20 X 2 is M, T, S, E, 

X 3 is I, S, Q, 

X 4 is L, G, 

X 5 is Y, E, Q; 

X 6 is F, H; 
25 X 7 is I, A; 

X 8 is L, A; and, 

X 9 is L, S 

whereby simultaneously X 1 = L, X 2 = M, X 3 = I, X 4 = L, X 5 = Y, X 6 = F, X 7 = I, 

X 8 = L and X 9 = L are excluded; 

30 • a modified human interferon alpha 2 (INFa2) having reduced immunogenicity 

consisting of the following sequence: 
CDLPQTHSLGSRRTLMLLAQMR^^ 
NX 3 X*STMDSSAAX 5 DETLLDKX 6 X 7 TELX a QQ 
TLYLKEKKYSPCAWEWRAEIMRSFSLSTJWLQESLRSKE, 

35 wherein X° is R, K; 
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X 1 is I,T; 
X 2 is F, D, A; 
X 3 is L,A; 
X 4 is F,D,E; 
5 X 5 is W,H; 

X 6 is F, D, E; 
X 7 is Y, S and 
X 8 is Y, D, E, N; 

whereby simultaneously X 1 = I, X 2 = F, X 3 = L, X 4 = F, X s = W, X 6 = F, X 7 = Y and X 8 = 
10 Y are excluded; 

• a modified human interferon alpha 2 (INFa2) having reduced immunogenicity 
consisting of the following sequence: 

CDLPQTHSLGSRRTLMLLAQMRX°ISLFSCLKDRHDFGFPQEEFGNQFQKAETIPVLHEMIQQX 1 F 
NLFSTKDSSAAWDETLLDKFX J TELX 3 QQLNDLEACVIQGVGVTETPLMKEDSILAVRKYFQRITL 
15 YLKEKKYSPCAWEWPvAEIMRSFSLSTNLQESLRSKE , 

wherein X° is R, K; 
X 1 is I,T; 
X 2 is Y, S and 
X 3 is Y, D, E, N; 

20 whereby simultaneously X 1 = I, X 2 = Y and X 3 = Y are excluded; 

• a modified human interferon alpha 2 (ENFa2) having reduced immunogenicity 
consisting of the following sequence: 

CDLPQTHSLGSRRTLMLLAQMRX°ISX 1 X 3 SCLKDRHDFGX 3 PQEEFGNQFQKAETIPX 4 LHEMIQQ 
IFNLFSTKDSSAAWDETLLDKFYTELYQQLMDLEACVIQGVGVTETPLMKEDSILAVRKYFQRIT 
25 LYLKEKKYSPCAWEWRAEIMRSFSLSTNLQESLRSKE , 
wherein X° is R, K; 
X 1 is; L,P, 
X 2 is;F,S, 
X 3 is F, E and 
30 X 4 is; V, A 

whereby simultaneously X 1 = L, X 2 = F, X 3 = F and X 4 = V are excluded; it should be 
pointed out that all exclusion specified in the above formulas refer to and shall include all 
non-deimmunized versions of IFNcc2 which are known in the prior art; 
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• corresponding modified INFoc2 sequences, wherein additional substitutions are 
made especially by the combinations as indicated in the claims; 

• a corresponding modified INFa2 sequence, wherein additional substitutions are made, 
preferably at one ore more positions within partial sequence Rl and / or R2 and /or R3 

5 (whereby Rl, R2, R3 are defined as indicated above); 

• a DNA sequence coding for a modified INFa2 as described above and below; 

• a pharmaceutical composition comprising a modified molecule having the biological 
activity of INFcc2 as defined above, optionally together with a pharmaceutic ally 
acceptable carrier, diluent or excipient; 

10 • a method for manufacturing a modified molecule having the biological activity of 
INFoc2 as defined above and below comprising the following steps: (i) determining the 
amino acid sequence of the polypeptide or part thereof, 

(ii) identifying one or more potential T-cell epitopes within the amino acid sequence of 
the protein by any method including determination of the binding of the peptides to MHC 

15 molecules using in vitro or in silico techniques or biological assays, (iii) designing new 
sequence variants with one or more amino acids within the identified potential T-cell 
epitopes modified in such a way to substantially reduce or eliminate the activity of the T- 
cell epitope as determined by the binding of the peptides to MHC molecules using in vitro 
or in silico techniques or biological assays ,or by binding of peptide-MHC complexes to 

20 T-ceils, (iv) constructing such sequence variants by recombinant DNA techniques and 
testing said variants in order to identify one or more variants with desirable properties, 
and (v) optionally repeating steps (ii) - (iv); 

• a corresponding method, wherein step (iii) is carried out by substitution, addition or 
deletion of 1 - 9 amino acid residues in any of the originally present T-cell epitopes or 

25 with reference to an homologous protein sequence and / or in silico modeling techniques; 

• a corresponding method, wherein step (ii) is carried out by the following steps: (a) 
selecting a region of the peptide having a known amino acid residue sequence; (b) 
sequentially sampling overlapping amino acid residue segments of predetermined uniform 
size and constituted by at least three amino acid residues from the selected region; (c) 

30 calculating MHC Class II molecule binding score for each said sampled segment by 

summing assigned values for each hydrophobic amino acid residue side chain present in 
said sampled amino acid residue segment; and (d) identifying at least one of said 
segments suitable for modification, based on the calculated MHC Class II molecule 
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binding score for that segment, to change overall MHC Class II binding score for the 
peptide without substantially the reducing therapeutic utility of the peptide; 

• a corresponding method, wherein step (c) is carried out by using a B5hm scoring 
function modified to include 12-6 van der Waal's ligand-protein energy repulsive term 

5 and ligand conformational energy term by (1) providing a first data base of MHC Class EL 
molecule models; (2) providing a second data base of allowed peptide backbones for said 
MHC Class II molecule models; (3) selecting a model from said first data base; (4) 
selecting an allowed peptide backbone from said second data base; (5) identifying amino 
acid residue side chains present in each sampled segment; (6) determining the binding 

10 affinity value for all side chains present in each sampled segment; and repeating steps (1) 
through (5) for each said model and each said backbone; 

• a peptide molecule, which is a T-cell epitope, consisting of 13 consecutive amino acid 
residues having a potential MHC class II binding activity and created from the primary 
sequence of non-modified INFa2 , selected from the group as depicted in Figure 1, Figure 

15 6a-c; 

• a peptide molecule consisting of 15, preferably at least 9, consecutive amino acid 
residues having a potential MHC class II binding activity and created from the primary 
sequence of non-modified INFa2, selected from any of the groups of partial sequences 
Rl, R2, R3 or selected from Figure 7; 

20 • a peptide molecule consisting of 9 - 15 consecutive amino acid residues, having a 
potential MHC class II binding activity and created from the primary sequence of non- 
modified ESfFoc2, whereby said molecule has a stimulation index of at least 1.8, preferably 
L8 - 2, more preferably > 2, in a biological assay of cellular proliferation, wherein said 
index is taken as the value of cellular proliferation scored following stimulation by a 

25 peptide and divided by the value of cellular proliferation scored in control cells not in 
receipt peptide and wherein cellular proliferation is measured by any suitable means 
according to standard methods as described in more detail in the Examples; 

• a corresponding peptide molecule having such stimulation index value consisting of 
any of the sequence as indicated in Figure 6 or 7; 

30 • a corresponding peptide molecule having such stimulation index value comprising at 
least 9 consecutive amino acid residues from any of the INFa2 partial sequences Rl, R2, 
R3 as defined in claim above; 
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• a use of a corresponding peptide, for the manufacture of INFoc2 having substantially no 
or less immunogenicity than any non-modified molecule with the same or acceptably 
reduced degree of biological activity when used in vivo; 

• a pharmaceutical composition consisting of a synthetic peptide sequence as specified 
5 above and below and in the Figures having the biological activity of IFNa2, optionally 

together with a pharmaceutically acceptable carrier, diluent or excipient. 

The term "T-cell epitope" means according to the understanding of this invention an 
amino acid sequence which is able to bind MHC class II, able to stimulate T-cells and / or 
10 also to bind (without necessarily measurably activating) T-cells in complex with MHC 
class II. 

The term "peptide" as used herein and in the appended claims, is a compound that 
includes two or more amino acids. The amino acids are linked together by a peptide bond 
(defined herein below). There are 20 different naturally occurring amino acids involved 

15 in the biological production of peptides, and any number of them may be linked in any 
order to form a peptide chain or ring. The naturally occurring amino acids employed in 
the biological production of peptides all have the L-configuration. Synthetic peptides can 
be prepared employing conventional synthetic methods, utilizing L-amino acids, D-amino 
acids, or various combinations of amino acids of the two different configurations. Some 

20 peptides contain only a few amino acid units. Short peptides, e.g., having less than ten 
amino acid units, are sometimes referred to as " oligopeptides". Other peptides contain a 
large number of amino acid residues, e.g. up to 100 or more, and are referred to as 
"polypeptides". By convention, a "polypeptide" may be considered as any peptide chain 
containing three or more amino acids, whereas a "oligopeptide" is usually considered as a 

25 particular type of "short" polypeptide. Thus, as used herein, it is understood that any 
reference to a "polypeptide" also includes an oligopeptide. Further, any reference to a 
"peptide" includes polypeptides, oligopeptides, and proteins. Each different arrangement 
of amino acids forms different polypeptides or proteins. The number of polypeptides-and 
hence the number of different proteins-that can be formed is practically unlimited. 

30 "Alpha carbon (Ccc)" is the carbon atom of the carbon-hydrogen (CH) component that is 
in the peptide chain. A "side chain" is a pendant group to Ca that can comprise a simple 
or complex group or moiety, having physical dimensions that can vary significantly 
compared to the dimensions of the peptide. 
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The invention may be applied to any INFa2 species of molecule with substantially the 
same primary amino acid sequences as those disclosed herein and would include therefore 
INFa2 molecules derived by genetic engineering means or other processes and may 
contain more or less than 165 amino acid residues. 

5 INFot2 proteins such as identified from other mammalian sources have in common many 
of the peptide sequences of the present disclosure and have in common many peptide 
sequences with substantially the same sequence as those of the disclosed listing. Such 
protein sequences equally therefore fall under the scope of the present invention. 
The invention is conceived to overcome the practical reality that soluble proteins 

10 introduced into autologous organisms can trigger an immune response resulting in 

development of host antibodies that bind to the soluble protein. A prominent example of 
this phenomenon amongst others, is the clinical use INFa2. A significant proportion of 
human patients treated with INFcc2 make antibodies despite the fact that this protein is 
produced endogenously [Russo, D. et al (1996) ibid; Stein, R. et al (1988) ibid\. The 

15 present invention seeks to address this by providing INFa2proteins with altered 

propensity to elicit an immune response on administration to the human host. According 
to the methods described herein, the inventors have discovered and now disclose the 
regions of the INFa2 molecule comprising the critical T-cell epitopes driving the immune 
responses to this autologous protein. 

20 

The general method of the present invention leading to the modified INFa2 comprises the 
following steps: 

(a) determining the amino acid sequence of the polypeptide or part thereof; 

(b) identifying one or more potential T-cell epitopes within the amino acid sequence of 
25 the protein by any method including determination of the binding of the peptides to MHC 

molecules using in vitro or in silico techniques or biological assays; 

(c) designing new sequence variants with one or more amino acids within the identified 
potential T-cell epitopes modified in such a way to substantially reduce or eliminate the 
activity of the T-cell epitope as determined by the binding of the peptides to MHC 

30 molecules using in vitro or in silico techniques or biological assays. Such sequence 
variants are created in such a way to avoid creation of new potential T-cell epitopes by 
the sequence variations unless such new potential T-cell epitopes are, in turn, modified in 
such a way to substantially reduce or eliminate the activity of the T-cell epitope; and 
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(d) constructing such sequence variants by recombinant DNA techniques and testing said 
variants in order to identify one or more variants with desirable properties according to 
well known recombinant techniques. 

5 The identification of potential T-cell epitopes according to step (b) can be carried out 
according to methods describes previously in the prior art. Suitable methods are 
disclosed in WO 98/59244; WO 98/52976; WO 00/34317 and may preferably be used to 
identify binding propensity of INFa2a -derived peptides to an MHC class II molecule. 

10 Another very efficacious method for identifying T-cell epitopes by calculation is 
described in the EXAMPLE 1 which is a preferred embodiment according to this 
invention. 

In practice a number of variant INFa2 proteins will be produced and tested for the desired 
15 immune and functional characteristic. The variant proteins will most preferably be 

produced by recombinant DNA techniques although other procedures including chemical 
synthesis of INFa2 fragments may be contemplated. 

The results of an analysis according to step (b) of the above scheme and pertaining to the 
human DSFFa2a protein sequence of 165 amino acid residues is presented in Figure 1. 
20 The results of a design and constructs according to step (c) and (d) of the above scheme 
and pertaining to the modified molecule of this invention is presented in Figures 2 and 3. 

The invention relates to INFa2 analogues in which substitutions of at least one amino 
acid residue have been made at positions resulting in a substantial reduction in activity of 
25 or elimination of one or more potential T-cell epitopes from the protein. One or more 
amino acid substitutions at particular points within any of the potential MHC class II 
ligands identified in Table 1 may result in a INFa2 molecule with a reduced 
immunogenic potential when administered as a therapeutic to the human host. 

30 It is most preferred to provide an INFoc2 molecule in which amino acid modification (e.g. 
a substitution) is conducted within the most immunogenic regions of the parent molecule. 
The inventors herein have discovered that the most immunogenic regions of the INFcc2 
molecule in man are confined to three regions. Rl, R2 and R3 comprising respectively 
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amino acid sequences; ISLFSCLKDRHDFGFPQEEFGNQFQKAETIPVLH; 
FNLFSTKDSSAAWDE and KEDSILAVRKYFQRITLY. The major preferred 
embodiments of the present invention comprise INFa2 molecules for which the MHC 
class II ligands of Figure 1 and which align either in their entirety or to a minimum of 9 
5 amino acid residues with any of the above sequence elements Rl, R2 or R3 are altered 
such as to eliminate binding or otherwise reduce the numbers of MHC allotypes to which 
the peptide can bind. 

The preferred embodiments of the invention include the specific substitutions of Figure 4. 
10 It is particularly preferred to provide modified DSfFa2 molecules containing combinations 
of substitutions from Figure 4. Combinations which comprise modification to each of the 
immunogenic regions Rl, R2 and R3 are preferred, and combinations comprising 
modifications to R2 and R3 are especially preferred although such preference is not 
intended to limit the combinations of substitution which are considered desirable. 

15 

For the elimination of T-cell epitopes, amino acid substitutions are preferably made at 
appropriate points within the peptide sequence predicted to achieve substantial reduction 
or elimination of the activity of the T-cell epitope. In practice an appropriate point will 
preferably equate to an amino acid residue binding within one of the pockets provided 

20 within the MHC class II binding groove. 

It is most preferred to alter binding within the first pocket of the cleft at the so-called PI 
or PI anchor position of the peptide. The quality of binding interaction between the PI 
anchor residue of the peptide and the first pocket of the MHC class II binding groove is 
recognized as being a major determinant of overall binding affinity for the whole peptide. 

25 An appropriate substitution at this position of the peptide will be for a residue less readily 
accommodated within the pocket, for example, substitution to a more hydrophilic residue. 
Amino acid residues in the peptide at positions equating to binding within other pocket 
regions within the MHC binding cleft are also considered and fall under the scope of the 
present. 

30 

It is understood that single amino acid substitutions within a given potential T-cell epitope 
are the most preferred route by which the epitope may be eliminated. Combinations of 
substitution within a single epitope may be contemplated and for example can be 
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particularly appropriate where individually defined epitopes are in overlap with each 
other. Moreover, amino acid substitutions either singly within a given epitope or in 
combination within a single epitope may be made at positions not equating to the "pocket 
residues" with respect to the MHC class II binding groove, but at any point within the 
5 peptide sequence. Substitutions may be made with reference to an homologues structure 
or structural method produced using in silico techniques known in the art and may be 
based on known structural features of the molecule according to this invention. All such 
substitutions fall within the scope of the present invention. 

10 Amino acid substitutions other than within the peptides identified above may be 

contemplated particularly when made in combination with substitution(s) made within a 
listed peptide. For example a change may be contemplated to restore structure or 
biological activity of the variant molecule. Such compensatory changes and changes to 
include deletion or addition of particular amino acid residues from the INFa2 polypeptide 

15 resulting in a variant with desired activity and in combination with changes in any of the 
disclosed peptides fall under the scope of the present. 

In as far as this invention relates to modified INFoc2, compositions containing such 
modified BSfFa2 proteins or fragments of modified INFa2 proteins and related 
compositions should be considered within the scope of the invention. In another aspect, 
20 the present invention relates to nucleic acids encoding modified INFa2 entities. In a 
further aspect the present invention relates to methods for therapeutic treatment of 
humans using the modified INFoc2 proteins. 

SHORT DESCRIPTION OF THE FIGURES 

25 The invention will now be illustrated, but not limited, by the following examples. The 
examples refer to the following drawings. The amino acid residues are consequently 
depicted as one-letter code. 

Figure 1 provides peptide sequences in human INFo2a with potential human MHC class 
30 II binding activity. 

Figure 2 provides substitutions leading to the elimination of T-cell epitopes of human 
INPa2a (WT = wild-type residue). 
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Figure 3 provides additional substitutions leading to the removal of a potential T-cell 
epitope for one or more MHC allotypes. 

Figure 4 provides preferred substitutions in human INFa2a (WT = wild-type residue, 
5 MUT = desired residue). 

Figure 5 provides a table of the INFa2 13-mer synthetic peptides sequences analysed 
using an MHC class II in vitro binding assay of EXAMPLE 2. 

10 Figure 6 shows the results of in vitro MHC peptide binding assays for MHC allotypes, a) 
indicates peptides with high affinity binding (0% inhibition by competitor reference 
peptide) for each of the MHC allotypes tested; b) indicates peptides with medium affinity 
(0-50% inhibition by competitor) binding for each of the MHC allotypes tested; c) 
indicates peptides with low (50-100% inhibition by competitor) affinity binding for each 

15 of the MHC allotypes tested and d) indicates peptides with no detectable binding to the 
MHC allotypes tested. 

Figure 7 provides a table of the INFoc2 15-mer peptide sequences analysed using the 
naive human in vitro T-cell assay of EXAMPLE 3. The peptide ID# and position of the 
20 N-terminal peptide residue within the INFoc2 sequence is indictated. 

Figure 8 shows cumulative stimulation indexes from 6 individuals that respond to 
stimulation with IFNcc peptides. Six donors from 20 screened responded to stimulation 
with one or more of 51 15mer peptides from the IFNa sequence. Responses to individual 
25 peptides are grouped into three distinct regions with region three containing the most 
immunogenic peptides #38 and #39 (arrows). Control peptides C32 (DRB1 -restricted) 
and C49 (DP-restricted) are included for comparison. Cross-hatching within each bar 
indicates the contribution from individual donors. 

30 Figure 9 shows the immunogenic regions within INFa and details the peptide sequences 
from these regions able to stimulate naive human T-cells. 

Figure 10 provides a table indicating INFa peptides capable of promoting proliferation of 
naive human T-cells in vitro. For 5 of the donors, responses are recorded to multiple 
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overlapping peptides from the major epitope regions Rl, R2 and R3. For 3 of the donors, . 
responses are recorded to individual synthetic peptides from Rl, R2 or R3. 

Figure 11 provides a table showing frequency of MHC class II alleles in the responding 
5 and non-responding donors to EFNoc peptides, a = Numerator is number of donors with 
DR allele, denominator is number of donors that showed T cell proliferation in vitro to 
that peptide (total number of responding donors = 6). b = Frequency of allele in donor 
population. Peptides for which two or fewer responses were recorded were not evaluated. 
All responding donors tested negative for DRB1*14. The DRB1*14 allotype has a 
10 frequency of 1.5% in the 20 donors tested. Allorestriction of a given peptide is 

determined by the frequency of an allele in the donor population and the number of 
responding donors that express the same allele. If a peptide is associated with any 
particular allele (allorestricted) then the percentage shown would be expected to be 
greater than the frequency for the allele in the population. 

15 

Figure 12 provides tables of IC 50 values for 15-mer synthetic peptides in competition 
binding assay for particular MHC class II allotypes. 

(a) Competition MHC class II peptide binding assay to determine relative binding 
affinities of INFa peptides capable of promoting proliferation of naive human T-cells in 

20 vitro to DRB1*0101. INFa peptides were incubated with fixed HOM-2 cells in the 

presence of 10 \iM biotinylated influenza haemagglutinin 307-319. The concentration of 
competitor peptide causing 50% inhibition of maximun biotinylated peptide binding was 
taken as the IC 50 . Influenza 103-115 was included as a high affinity control. IC50 <20 \\M 
= high affinity, IC 5 o = 20-100 |jM = Medium Affinity, IC 50 >100 uM = Low Affinity. 

25 (b) Competition MHC class II peptide binding assay to determine relative binding 

affinities of INFa peptides capable of promoting proliferation of naive human T-cells in 
vitro to DRB 1*0701. INFa peptides were incubated with fixed MOU (MANN) cells in 
the presence of lOuM biotinylated tetanus toxin 828-840. The concentration of competitor 
peptide causing 50% inhibition of maximun biotinylated peptide binding was taken as the 

30 IC 5 o. Tetanus toxin 828-840 was included as a high affinity control. IC 50 <20 yM = high 
affinity, IC 50 = 20-100 jaM = Medium Affinity, IC 50 >100 jjM = Low Affinity, 
(c) Competition MHC class II peptide binding assay to determine relative binding 
affinities of INFa peptides capable of promoting proliferation of naive human T-cells in 
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vitro to DRB 1*0401. INFa peptides were incubated with fixed WT-51 cells in the 
presence of 50 uM biotinylated influenza haemagglutinin 307-319. The concentration of 
competitor peptide causing 50% inhibition of maximun biotinylated peptide binding was 
taken as the IC 50 . Influenza 103-115 was included as a high affinity control. IC50 <20 nM 
5 = high affinity, IC 50 = 20-100 jjM = Medium Affinity, IC 50 >100 |iM = Low Affinity. 

Figure 13 provides a table detailing substitutions within INFa which provide molecules 
with retained activity in the anti-proliferation assay of EXAMPLE 7. 
WT = wild-type residue; # = residue number; Mut = mutation conducted. Epitope Region 
10 indicates location of substitution with respect to immunogenic epitope regions Rl, R2 or 
R3. 

Figure 14 provides representative data of the anti-proliferative effect of selected mutant 
ESfFa2 molecules. Assays were conducted according to the methods of EXAMPLE 7. 
15 Panel a) shows activity of molecules with substitution within immunogenic epitope Rl. 
Panel b) shows activity of molecules with substitution within immunogenic epitope R2. 
Panel c) shows activity of molecules with substitution within immunogenic epitope R3. 

Figure 15 provides panels that show individual donor responses to INFa synthetic 
20 peptides. Data from control peptides C32 (DRBl-restricted) and C49 (DP-restricted) are 
included for comparison. Immunogenic regions Rl, R2, R3 are indicated on relevant 
panels. Threshold for positive stimulation index = 2. 

25 In the following Examples the invention is described in more detail which shall not be 
interpreted as a limitation or restriction. 

EXAMPLE 1 

There are a number of factors that play important roles in determining the total structure 
30 of a protein or polypeptide. First, the peptide bond, i.e., that bond which joins the amino 
acids in the chain together, is a covalent bond. This bond is planar in structure, 
essentially a substituted amide. An "amide" is any of a group of organic compounds 
containing the grouping -CONH-. 
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The planar peptide bond linking Ca of adjacent amino acids may be represented as 
depicted below: 



Because the 0=C and the C-N atoms lie in a relatively rigid plane, free rotation does not 

5 occur about these axes. Hence, a plane schematically depicted by the interrupted line is 
sometimes referred to as an "amide" or "peptide plane" plane wherein lie the oxygen (O), 
carbon (C), nitrogen (N), and hydrogen (H) atoms of the peptide backbone. At opposite 
corners of this amide plane are located the Ca atoms. Since there is substantially no 
rotation about the 0=C and C-N atoms in the peptide or amide plane, a polypeptide chain 

10 thus comprises a series of planar peptide linkages joining the Ca atoms. 
A second factor that plays an important role in defining the total structure or 
conformation of a polypeptide or protein is the angle of rotation of each amide plane 
about the common Ca linkage. The terms "angle of rotation" and "torsion angle" are 
hereinafter regarded as equivalent terms. Assuming that the O, C, N, and H atoms remain 

15 in the amide plane (which is usually a valid assumption, although there may be some 
slight deviations from planarity of these atoms for some conformations), these angles of 
rotation define the N and R polypeptide's backbone conformation, i.e., the structure as it 
exists between adjacent residues. These two angles are known as § andty. A set of the 
angles <t>i, where the subscript represents a particular residue of a polypeptide chain, 

20 thus effectively defines the polypeptide secondary structure. The conventions used in 
defining the (J), y angles, i.e., the reference points at which the amide planes form a zero 
degree angle, and the definition of which angle is 4>, and which angle is y, for a given 
polypeptide, are defined in the literature (see, e.g., Ramachandran et al. Adv. Prot Chem. 
23:283-437 (1968), at pages 285-94, which pages are incorporated herein by reference). 

25 

The present method can be applied to any protein, and is based in part upon the discovery 
that in humans the primary Pocket 1 anchor position of MHC Class II molecule binding 
grooves has a well designed specificity for particular amino acid side chains. The 


WO 02/085941 


- 23 - 


PCT/EP02/02218 


specificity of this pocket is determined by the identity of the amino acid at position 86 of 
the beta chain of the MHC Class II molecule. This site is located at the bottom of Pocket 
1 and determines the size of the side chain that can be accommodated by this pocket. 
Marshall, K.W., J. Immunol, 152:4946-4956 (1994). If this residue is a glycine, then all 

5 hydrophobic aliphatic and aromatic amino acids (hydrophobic aliphatics being: valine, 
leucine, isoleucine, methionine and aromatics being: phenylalanine, tyrosine and 
tryptophan) can be accommodated in the pocket, a preference being for the aromatic side 
chains. If this pocket residue is a valine, then the side chain of this amino acid protrudes 
into the pocket and restricts the size of peptide side chains that can be accommodated 

10 such that only hydrophobic aliphatic side chains can be accommodated. Therefore, in an 
amino acid residue sequence, wherever an amino acid with a hydrophobic aliphatic or 
aromatic side chain is found, there is the potential for a MHC Class II restricted T-cell 
epitope to be present. If the side-chain is hydrophobic aliphatic, however, it is 
approximately twice as likely to be associated with a T-cell epitope than an aromatic side 

15 chain (assuming an approximately even distribution of Pocket 1 types throughout the 
global population). 

A computational method embodying the present invention profiles the likelihood of 
peptide regions to contain T-cell epitopes as follows: 

(1) The primary sequence of a peptide segment of predetermined length is scanned, and 
20 all hydrophobic aliphatic and aromatic side chains present are identified. (2)The 

hydrophobic aliphatic side chains are assigned a value greater than that for the aromatic 
side chains; preferably about twice the value assigned to the aromatic side chains, e.g., a 
value of 2 for a hydrophobic aliphatic side chain and a value of 1 for an aromatic side 
chain. (3) The values determined to be present are summed for each overlapping amino 
25 acid residue segment (window) of predetermined uniform length within the peptide, and 
the total value for a particular segment (window) is assigned to a single amino acid 
residue at an intermediate position of the segment (window), preferably to a residue at 
about the midpoint of the sampled segment (window). This procedure is repeated for 
each sampled overlapping amino acid residue segment (window). Thus, each amino acid 
30 residue of the peptide is assigned a value that relates to the likelihood of a T-cell epitope 
being present in that particular segment (window). (4) The values calculated and assigned 
as described in Step 3, above, can be plotted against the amino acid coordinates of the 
entire amino acid residue sequence being assessed. (5) All portions of the sequence which 
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have a score of a predetermined value, e.g., a value of 1, are deemed likely to contain a T- 
cell epitope and can be modified, if desired. 

This particular aspect of the present invention provides a general method by which the 
regions of peptides likely to contain T-cell epitopes can be described. Modifications to the 
5 peptide in these regions have the potential to modify the MHC Class II binding 
characteristics. 

According to another aspect of the present invention, T-cell epitopes can be predicted 
with greater accuracy by the use of a more sophisticated computational method which 
takes into account the interactions of peptides with models of MHC Class II alleles. 

10 The computational prediction of T-cell epitopes present within a peptide according to this 
particular aspect contemplates the construction of models of at least 42 MHC Class II 
alleles based upon the structures of all known MHC Class II molecules and a method for 
the use of these models in the computational identification of T-cell epitopes, the 
construction of libraries of peptide backbones for each model in order to allow for the 

15 known variability in relative peptide backbone alpha carbon (Coc) positions, the 

construction of libraries of amino-acid side chain conformations for each backbone dock 
with each model for each of the 20 amino-acid alternatives at positions critical for the 
interaction between peptide and MHC Class II molecule, and the use of these libraries of 
backbones and side-chain conformations in conjunction with a scoring function to select 

20 the optimum backbone and side-chain conformation for a particular peptide docked with a 
particular MHC Class II molecule and the derivation of a binding score from this 
interaction. 

Models of MHC Class II molecules can be derived via homology modeling from a 
number of similar structures found in the Brookhaven Protein Data Bank ("PDB"). These 

25 may be made by the use of semi-automatic homology modeling software (Modeller, Sali 
A. & Blundell TL., 1993. J. Mol Biol 234:779-815) which incorporates a simulated 
annealing function, in conjunction with the CHARMm force-field for energy 
minimisation (available from Molecular Simulations Inc., San Diego, Ca.). Alternative 
modeling methods can be utilized as well. 

30 The present method differs significantly from other computational methods which use 
libraries of experimentally derived binding data of each amino-acid alternative at each 
position in the binding groove for a small set of MHC Class II molecules (Marshall, 
K.W., et al, Biomed. Pept. Proteins Nucleic Acids, 1(3):157-162) (1995) or yet other 
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computational methods which use similar experimental binding data in order to define the 
binding characteristics of particular types of binding pockets within the groove, again 
using a relatively small subset of MHC Class II molecules, and then 'mixing and 
matching' pocket types from this pocket library to artificially create further 'virtual' 

5 MHC Class II molecules (Sturniolo T M et al., Nat. Biotech, 17(6): 555-561 (1999). Both 
prior methods suffer the major disadvantage that, due to the complexity of the assays and 
the need to synthesize large numbers of peptide variants, only a small number of MHC 
Class II molecules can be experimentally scanned. Therefore the first prior method can 
only make predictions for a small number of MHC Class II molecules. The second prior 

10 method also makes the assumption that a pocket lined with similar amino-acids in one 
molecule will have the same binding characteristics when in the context of a different 
Class II allele and suffers further disadvantages in that only those MHC Class II 
molecules can be 'virtually' created which contain pockets contained within the pocket 
library. Using the modeling approach described herein, the structure of any number and 

15 type of MHC Class II molecules can be deduced, therefore alleles can be specifically 
selected to be representative of the global population. In addition, the number of MHC 
Class II molecules scanned can be increased by making further models further than 
having to generate additional data via complex experimentation. 

The use of a backbone library allows for variation in the positions of the Ccc atoms of the 
20 various peptides being scanned when docked with particular MHC Class II molecules. 
This is again in contrast to the alternative prior computational methods described above 
which rely on the use of simplified peptide backbones for scanning amino-acid binding in 
particular pockets. These simplified backbones are not likely to be representative of 
backbone conformations found in 'real' peptides leading to inaccuracies in prediction of 
25 peptide binding. The present backbone library is created by superposing the backbones of 
all peptides bound to MHC Class II molecules found within the Protein Data Bank and 
noting the root mean square (RMS) deviation between the Ca atoms of each of the eleven 
amino-acids located within the binding groove. While this library can be derived from a 
small number of suitable available mouse and human structures (currently 13), in order to 
30 allow for the possibility of even greater variability, the RMS figure for each C"-oc position 
is increased by 50%. The average Ca position of each amino-acid is then determined and 
a sphere drawn around this point whose radius equals the RMS deviation at that position 
plus 50%. This sphere represents all allowed Ca positions. 
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Working from the Ca with the least RMS deviation (that of the amino-acid in Pocket 1 as 
mentioned above, equivalent to Position 2 of the 11 residues in the binding groove), the 
sphere is three-dimensionally gridded, and each vertex within the grid is then used as a 
possible location for a Ca of that amino-acid. The subsequent amide plane, corresponding 
5 to the peptide bond to the subsequent amino-acid is grafted onto each of these Cos and 
the <j) and \}/ angles are rotated step-wise at set intervals in order to position the subsequent 
Ca. If the subsequent Ca falls within the 'sphere of allowed positions' for this Ca than 
the orientation of the dipeptide is accepted, whereas if it falls outside the sphere then the 
dipeptide is rejected. 

10 This process is then repeated for each of the subsequent Ca positions, such that the 
peptide grows from the Pocket 1 Ca 'seed', until all nine subsequent Cos have been 
positioned from all possible permutations of the preceding Cos. The process is then 
repeated once more for the single Ca preceding pocket 1 to create a library of backbone 
Ca positions located within the binding groove. 

15 The number of backbones generated is dependent upon several factors: The size of the 
'spheres of allowed positions'; the fineness of the gridding of the 'primary sphere' at the 
Pocket 1 position; the fineness of the step-wise rotation of the § and \|/ angles used to 
position subsequent Cas. Using this process, a large library of backbones can be created. 
The larger the backbone library, the more likely it will be that the optimum fit will be 

20 found for a particular peptide within the binding groove of an MHC Class II molecule. 
Inasmuch as all backbones will not be suitable for docking with all the models of MHC 
Class II molecules due to clashes with amino-acids of the binding domains, for each allele 
a subset of the library is created comprising backbones which can be accommodated by 
that allele. 

25 The use of the backbone library, in conjunction with the models of MHC Class II 

molecules creates an exhaustive database consisting of allowed side chain conformations 
for each amino-acid in each position of the binding groove for each MHC Class II 
molecule docked with each allowed backbone. This data set is generated using a simple 
steric overlap function where a MHC Class II molecule is docked with a backbone and an 

30 amino-acid side chain is grafted onto the backbone at the desired position. Each of the 
rotatable bonds of the side chain is rotated step-wise at set intervals and the resultant 
positions of the atoms dependent upon that bond noted. The interaction of the atom with 
atoms of side-chains of the binding groove is noted and positions are either accepted or 
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rejected according to the following criteria: The sum total of the overlap of all atoms so 
far positioned must not exceed a pre-determined value. Thus the stringency of the 
conformational search is a function of the interval used in the step-wise rotation of the 
bond and the pre-determined limit for the total overlap. This latter value can be small if it 

5 is known that a particular pocket is rigid, however the stringency can be relaxed if the 
positions of pocket side-chains are known to be relatively flexible. Thus allowances can 
be made to imitate variations in flexibility within pockets of the binding groove. This 
conformational search is then repeated for every amino-acid at every position of each 
backbone when docked with each of the MHC Class II molecules to create the exhaustive 

10 database of side-chain conformations. 

A suitable mathematical expression is used to estimate the energy of binding between 
models of MHC Class II molecules in conjunction with peptide ligand conformations 
which have to be empirically derived by scanning the large database of backbone/side- 
chain conformations described above. Thus a protein is scanned for potential T-cell 

15 epitopes by subjecting each possible peptide of length varying between 9 and 20 amino- 
acids (although the length is kept constant for each scan) to the following computations: 
An MHC Class II molecule is selected together with a peptide backbone allowed for that 
molecule and the side-chains corresponding to the desired peptide sequence are grafted 
on. Atom identity and interatomic distance data relating to a particular side-chain at a 

20 particular position on the backbone are collected for each allowed conformation of that 
amino-acid (obtained from the database described above). This is repeated for each side- 
chain along the backbone and peptide scores derived using a scoring function. The best 
score for that backbone is retained and the process repeated for each allowed backbone 
for the selected model. The scores from all allowed backbones are compared and the 

25 highest score is deemed to be the peptide score for the desired peptide in that MHC Class 
II model. This process is then repeated for each model with every possible peptide 
derived from the protein being scanned, and the scores for peptides versus models are 
displayed. 

In the context of the present invention, each ligand presented for the binding affinity 
30 calculation is an amino-acid segment selected from a peptide or protein as discussed 
above. Thus, the ligand is a selected stretch of amino acids about 9 to 20 amino acids in 
length derived from a peptide, polypeptide or protein of known sequence. The terms 
"amino acids" and "residues" are hereinafter regarded as equivalent terms. 
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The ligand, in the form of. the consecutive amino acids of the peptide to be examined 
grafted onto a backbone from the backbone library, is positioned in the binding cleft of an 
MHC Class II molecule from the MHC Class II molecule model library via the 
coordinates of the C"-a atoms of the peptide backbone and an allowed conformation for 

5 each side-chain is selected from the database of allowed conformations. The relevant 
atom identities and interatomic distances are also retrieved from this database and used to 
calculate the peptide binding score. Ligands with a high binding affinity for the MHC 
Class II binding pocket are flagged as candidates for site-directed mutagenesis. Amino- 
acid substitutions are made in the flagged ligand (and hence in the protein of interest) 

10 which is then retested using the scoring function in order to determine changes which 
reduce the binding affinity below a predetermined threshold value. These changes can 
then be incorporated into the protein of interest to remove T-cell epitopes. 
Binding between the peptide ligand and the binding groove of MHC Class II molecules 
involves non-covalent interactions including, but not limited to: hydrogen bonds, 

15 electrostatic interactions, hydrophobic (lipophilic) interactions and Van der Walls 
interactions. These are included in the peptide scoring function as described in detail 
below. 

It should be understood that a hydrogen bond is a non-covalent bond which can be formed 
between polar or charged groups and consists of a hydrogen atom shared by two other 

20 atoms. The hydrogen of the hydrogen donor has a positive charge where the hydrogen 
acceptor has a partial negative charge. For the purposes of peptide/protein interactions, 
hydrogen bond donors may be either nitrogens with hydrogen attached or hydrogens 
attached to oxygen or nitrogen. Hydrogen bond acceptor atoms may be oxygens not 
attached to hydrogen, nitrogens with no hydrogens attached and one or two connections, 

25 or sulphurs with only one connection. Certain atoms, such as oxygens attached to 

hydrogens or imine nitrogens (e.g. C=NH) may be both hydrogen acceptors or donors. 
Hydrogen bond energies range from 3 to 7 Kcal/mol and are much stronger than Van der 
Waal's bonds, but weaker than covalent bonds. Hydrogen bonds are also highly 
directional and are at their strongest when the donor atom, hydrogen atom and acceptor 

30 atom are co-linear. 

Electrostatic bonds are formed between oppositely charged ion pairs and the strength of 
the interaction is inversely proportional to the square of the distance between the atoms 
according to Coulomb's law. The optimal distance between ion pairs is about 2.8 A. In 
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protein/peptide interactions, electrostatic bonds may be formed between arginine, 
histidine or lysine and aspartate or glutamate. The strength of the bond will depend upon 
the pKa of the ionizing group and the dielectric constant of the medium although they are 
approximately similar in strength to hydrogen bonds. 

5 Lipophilic interactions are favorable hydrophobic-hydrophobic contacts that occur 
between he protein and peptide ligand. Usually, these will occur between hydrophobic 
amino acid side chains of the peptide buried within the pockets of the binding groove 
such that they are not exposed to solvent. Exposure of the hydrophobic residues to solvent 
is highly unfavorable since the surrounding solvent molecules are forced to hydrogen 

10 bond with each other forming cage-like clathrate structures. The resultant decrease in 
entropy is highly unfavorable. Lipophilic atoms may be sulphurs which are neither polar 
nor hydrogen acceptors and carbon atoms which are not polar. 

Van der Waal's bonds are non-specific forces found between atoms which are 3-4A apart. 
They are weaker and less specific than hydrogen and electrostatic bonds. The distribution 

15 of electronic charge around an atom changes with time and, at any instant, the charge 
distribution is not symmetric. This transient asymmetry in electronic charge induces a 
similar asymmetry in neighboring atoms. The resultant attractive forces between atoms 
reaches a maximum at the Van der Waal's contact distance but diminishes very rapidly at 
about 1 A to about 2 A. Conversely, as atoms become separated by less than the contact 

20 distance, increasingly strong repulsive forces become dominant as the outer electron 

clouds of the atoms overlap. Although the attractive forces are relatively weak compared 
to electrostatic and hydrogen bonds (about 0.6 Kcal/mol), the repulsive forces in 
particular may be very important in determining whether a peptide ligand may bind 
successfully to a protein. 

25 In one embodiment, the Bohm scoring function (SCORE 1 approach) is used to estimate 
the binding constant. (Bohm, H.J., 7. Comput Aided Mol. Des., 8(3):243-256 (1994) 
which is hereby incorporated in its entirety). In another embodiment, the scoring function 
(SCORE2 approach) is used to estimate the binding affinities as an indicator of a ligand 
containing a T-cell epitope (Bohm, H.J., J. Comput Aided Mol Des., 12(4):309-323 

30 (1998) which is hereby incorporated in its entirety). However, the Bohm scoring 

functions as described in the above references are used to estimate the binding affinity of 
a ligand to a protein where it is already known that the ligand successfully binds to the 
protein and the protein/ligand complex has had its structure solved, the solved structure 
being present in the Protein Data Bank ("PDB")- Therefore, the scoring function has 
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been developed with the benefit of known positive binding data. In order to allow for 
discrimination between positive and negative binders, a repulsion term must be added to 
the equation. In addition, a more satisfactory estimate of binding energy is achieved by 
computing the lipophilic interactions in a pairwise manner rather than using the area 

5 based energy term of the above B6hm functions. 

Therefore, in a preferred embodiment, the binding energy is estimated using a modified 
Bohm scoring function. In the modified Bohm scoring function, the binding energy 
between protein and ligand (AGbind) is estimated considering the following parameters: 
The reduction of binding energy due to the overall loss of translational and rotational 

10 entropy of the ligand (AG 0 ); contributions from ideal hydrogen bonds (AG h b) where at 
least one partner is neutral; contributions from unperturbed ionic interactions (AQonic); 
lipophilic interactions between lipophilic ligand atoms and lipophilic acceptor atoms 
(AGiipo); the loss of binding energy due to the freezing of internal degrees of freedom in 
the ligand, i.e., the freedom of rotation about each C-C bond is reduced (AG rot ); the 

15 energy of the interaction between the protein and ligand (Evaw). Consideration of these 
terms gives equation 1 : 

(AGbind) = ( AG 0 ) + ( AGhbXNnbKC AGionicXN ionic ) + ( AGiip 0 xNiipo) + ( 

AG r o t +N r ot) + (E vdw) - 

Where N is the number of qualifying interactions for a specific term and, in one 
20 embodiment, AG 0 , AG h b, AGionic, AGii po and AG rot are constants which are given the 
values: 5.4, -4.7, -4.7, -0.17, and 1.4, respectively. 
The term N h b is calculated according to equation 2 : 
Nhb = Lh -bonds 

f(AR, Aa) X f (N n eighb) X J-pcs 

f(AR, Aa) is a penalty function which accounts for large deviations of hydrogen bonds 
25 from ideality and is calculated according to equation 3 : 
f(AR, A-oc) = fl(AR) x f2(Aoc) 
Where: fl(AR) = 1 if AR <= TOL 


or 


= 1 - (AR - TOD/0.4 if AR <= 0.4 + TOL 


or 


= 0 if AR >0 .4 + TOL 


30 And: 


f 2 (Aa) 


= 1 if Aa <30° 


or 


= l-( Aa - 30) /50 if Aa <=80° 


or 


= 0 if Aa >80° 
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TOL is the tolerated deviation in hydrogen bond length = 0.25 A 

AR is the deviation of the H-O/N hydrogen bond length from the ideal value = 1.9 A 

Ace is the deviation of the hydrogen bond angle Z n/o-h..o/n from its idealized value of 

180° 

5 f(N ne ighb) distinguishes between concave and convex parts of a protein surface and 
therefore assigns greater weight to polar interactions found in pockets rather than those 
found at the protein surface. This function is calculated according to equation 4 below: 
f(Nneighb) = (N nei ghb/Nneighb, o ) a where a = 0.5 

Nneighb is the number of non-hydrogen protein atoms that are closer than 5 A to any given 
10 protein atom. 

N a eighb,o is a constant = 25 

f pcs is a function which allows for the polar contact surface area per hydrogen bond and 

therefore distinguishes between strong and weak hydrogen bonds and its value is 

determined according to the following criteria: 
15 f pcs = S when Apoiar/N H B < 10 A 2 

or fp CS = 1 when A P oiar/N H B > 10 A 2 

-A^polar is the size of the polar protein-ligand contact surface 

Nhb is the number of hydrogen bonds 

B is a constant whose value =1.2 
20 For the implementation of the modified Bohm scoring function, the contributions from 

ionic interactions, AGi 0n ic, are computed in a similar fashion to those from hydrogen 

bonds described above since the same geometry dependency is assumed. 

The term Ni ipo is calculated according to equation 5 below: 

Nupo = SiLfC^iL) 

25 f(ri L ) is calculated for all lipophilic ligand atoms, 1, and all lipophilic protein atoms, L, 
according to the following criteria: 

f (n L ) =1 when r iL <= Rlf (r iL ) =(r iL - Rl)/(R2-Rl) when R2 <r 1L 
> Rl 

f (ri L ) =0 when r iL >= R2 
30 Where: Rl = n vdw + r L vdw +0.5 
and R2 = Rl + 3 . 0 

and n vdw is the Van der Waal's radius of atom 1 
and r L vdw is the Van der Waal's radius of atom L 
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The term N ro t is the number of rotable bonds of the amino acid side chain and is taken to 
be the number of acyclic sp 3 - sp 3 and sp 3 - sp 2 bonds. Rotations of terminal -CH3 or - 
NH3 are not taken into account. 

The final term, E V d\v, is calculated according to equation 6 below: 
5 E vdw = e 1 e 2 ((r 1 vdw -fr 2 vdw ) 12 /r 12 - (n vdw +r 2 vdw ) 6 /r 6 ) , where: 
8i and e 2 are constants dependant upon atom identity 
n vdw +r 2 vdw are the Van der Waal's atomic radii 
r is the distance between a pair of atoms. 

With regard to Equation 6, in one embodiment, the constants £1 and e 2 are given the atom 

10 values: C: 0.245, N: 0.283, O: 0.316, S: 0.316, respectively (i.e. for atoms of Carbon, 
Nitrogen, Oxygen and Sulphur, respectively). With regards to equations 5 and 6, the Van 
der Waal's radii are given the atom values C: 1.85, N: 1.75, O: 1.60, S: 2.00A. 
It should be understood that all predetermined values and constants given in the equations 
above are determined within the constraints of current understandings of protein ligand 

15 interactions with particular regard to the type of computation being undertaken herein. 
Therefore, it is possible that, as this scoring function is refined further, these values and 
constants may change hence any suitable numerical value which gives the desired results 
in terms of estimating the binding energy of a protein to a ligand may be used and hence 
fall within the scope of the present invention. 

20 As described above, the scoring function is applied to data extracted from the database of 
side-chain conformations, atom identities, and interatomic distances. For the purposes of 
the present description, the number of MHC Class II molecules included in this database 
is 42 models plus four solved structures. It should be apparent from the above 
descriptions that the modular nature of the construction of the computational method of 

25 the present invention means that new models can simply be added and scanned with the 
peptide backbone library and side-chain conformational search function to create 
additional data sets which can be processed by the peptide scoring function as described 
above. This allows for the repertoire of scanned MHC Class II molecules to easily be 
increased, or structures and associated data to be replaced if data are available to create 

30 more accurate models of the existing alleles. 

The present prediction method can be calibrated against a data set comprising a large 
number of peptides whose affinity for various MHC Class II molecules has previously 
been experimentally determined. By comparison of calculated versus experimental data, 
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a cut of value can be determined above which it is known that all experimentally 
determined T-cell epitopes are correctly predicted. 

It should be understood that, although the above scoring function is relatively simple 
compared to some sophisticated methodologies that are available, the calculations are 
5 performed extremely rapidly. It should also be understood that the objective is not to 
calculate the true binding energy per se for each peptide docked in the binding groove of 
a selected MHC Class II protein. The underlying objective is to obtain comparative 
binding energy data as an aid to predicting the location of T-cell epitopes based on the 
primary structure (i.e. amino acid sequence) of a selected protein. A relatively high 
10 binding energy or a binding energy above a selected threshold value would suggest the 
presence of a T-cell epitope in the ligand. The ligand may then be subjected to at least 
one round of amino-acid substitution and the binding energy recalculated. Due to the 
rapid nature of the calculations, these manipulations of the peptide sequence can be 
performed interactively within the program's user interface on cost-effectively available 
15 computer hardware. Major investment in computer hardware is thus not required. 

It would be apparent to one skilled in the art that other available software could be used 
for the same purposes. In particular, more sophisticated software which is capable of 
docking ligands into protein binding-sites may be used in conjunction with energy 
minimization. Examples of docking software are: DOCK (Kuntz et ah, J. MoL BioL, 
20 161:269-288 (1982)), LUDI (Bohm, H.J., /. Comput Aided MoL Des., 8:623-632 (1994)) 
and FLEXX (Rarey M., et aL 9 ISMB, 3:300-308 (1995)). Examples of molecular 
modeling and manipulation software include: AMBER (Tripos) and CHARMm 
(Molecular Simulations Inc.). The use of these computational methods would severely 
limit the throughput of the method of this invention due to the lengths of processing time 
25 required to make the necessary calculations. However, it is feasible that such methods 
could be used as a 'secondary screen' to obtain more accurate calculations of binding 
energy for peptides which are found to be 'positive binders' via the method of the present 
invention. 

The limitation of processing time for sophisticated molecular mechanic or molecular 
30 dynamic calculations is one which is defined both by the design of the software which 
makes these calculations and the current technology limitations of computer hardware. It 
may be anticipated that, in the future, with the writing of more efficient code and the 
continuing increases in speed of computer processors, it may become feasible to make 
such calculations within a more manageable time-frame. 
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Further information on energy functions applied to macromolecules and consideration of 
the various interactions that take place within a folded protein structure can be found in: 
Brooks, B.R., et aL, J. Comput Chem., 4:187-217 (1983) and further information 
concerning general protein-ligand interactions can be found in: Dauber-Osguthorpe et aL, 
5 Proteins4(l):3 1-47(1988), which are incorporated herein by reference in their entirety. 
Useful background information can also be found, for example, in Fasman, G.D., ed., 
Prediction of Protein Structure and the Principles of Protein Conformation, Plenum 
Press, New York, ISBN: 0-306 4313-9. 

10 The following examples describe the invention in more detail. 

EXAMPLE 2 

The 165 amino acid sequence of INFoc2 was analyzed in silico broadly by the method of 
EXAMPLE 1. A panel of 57 13-mer synthetic peptides were produced and analyzed for 
15 their ability to bind in vitro with human MHC class II molecules. The peptide sequences 
are depicted in FIGURE 5. 

MHC class II synthetic peptide binding assays were conducted using human 
lymphoblastoid B cells of known HLA-DR allotype. Cells were fixed with 
paraformaldeyde and incubated with either biotinylated peptides alone or with a non- 
20 biotinylated competitor peptide to determine IC50 values. Following incubation with the 
peptides, cells were lysed and the MHC Class II molecules captured by the anti-HLA-DR 
cc-chain monoclonal antibody LB3.1. Bound biotinylated peptide was detected by 
streptavidin peroxidase, and the amount of bound peptide quantitated by a luminescent 
read out. 


25 
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Competition-binding assays were conducted, where non-biotinylated test peptides were 
incubated with the fixed cells in the presence of the biotinylated competitor peptide. 
Competitor peptides were previously determined to have IC50 values for the particular 
allotypes of interest using a simple (non-competitive) binding assay. The IC 50 value is the 
5 concentration of the unlabeled peptide that prevents 50% of the labelled peptide from 
binding. The concentration of the biotinylated peptide was determined experimentally to 
be at least one sixth of its measured ED 50 (concentration of peptide that gives one half of 
the maximum response) for each allele, to ensure that the inhibition was primarily 
measuring the binding characteristics of the competitor peptide. 

10 

EB V transformed human B lymphoblastoid cell lines are obtainable from EC ACC 
(Salisbury, UK). HOM-2 cells were used in assays for DRB1*0101 binding; WT51 cells 
were used in assays of DRB 1*0401 binding and MOU (MANN) cells were used for 
assays of DRB 1*0701 binding. The mouse hybridoma LB 3.1 was obtained from the 

15 American Tissue Culture Collection ATCC (Virginia, USA). Enhanced 

Chemiluminescent (ECL) reagent was purchased from Amersham Pharmacia (Amersham, 
UK). RPMI 1640 medium, L-glutamine, and penicillin/streptomycin were obtained from 
Life Technologies (Paisley, UK). Optiplates™ were obtained from Packard (Pangbourne, 
England). Biotinylated peptides were obtained from Babraham Tech mx (Cambridge, 

20 England) and non-biotinylated peptides from Pepscan Systems (Lelystad, The 

Netherlands). Prosep A was obtained from Millipore (Watford, UK). DAB, PMSF, 
iodoacetamide, benzamidine, leupeptin, pepstatin, PBS tablets, DMSO, BSA, streptavidin 
peroxidase conjugate and all other chemicals were obtained from The Sigma Chemical 
Company (Poole,UK). 

25 

Lymphoblastoid cells were cultivated in RPMI- 1640 medium plus 10% foetal bovine 
serum (FBS), L-glutamine, and penicillin/streptomycin in a humidified atmosphere at 
37°C/ 5% C0 2 . 
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LB3.1 hybridoma cells were cultivated in RPMI-1640 medium plus 10% foetal bovine 
serum (FBS), L-glutamine, and penicillin/streptomycin in a humidified atmosphere at 
37°C/ 5% C02 and LB3.1 antibody purified from the culture supernatant. The 
supernatant was filtered using (X22jiM filters then 50ml of 1M Tris pH 8.0 was added per 

5 450ml making a O.lM-buffered solution. The buffered supernatant was then passed 
through a 3ml PROSEP A column overnight at 4°C and washed with 25ml of PBS. 
LB3.1 antibody was eluted using 8ml of 0.1M citrate pH 3.0, and each 0.5 ml fraction 
was collected into 500p.l 1M Tris-HCl pH 8.0. The protein content of each fraction was 
determined using a spectrophotometer (A280nm). Fractions were pooled and dialysed 

10 into 800ml PBS, using a Slide-A-Lyser 3.5K cut-off (Pierce). Purity of the LB3.1 was 
checked by reduced SDS-PAGE followed by Coomassie staining. 

Binding assays for each peptide/allotype combination were conducted in triplicate in 96- 
we.H flat bottom Optiplates™ using 2xl0 6 cells per well. Cells were washed twice with 

15 RPMI-1640 then fixed with 0.5% paraformaldehyde/PBS for 30 min on ice. After-2 
washes with RPMI-1640 the cells were incubated with either: biotinylated peptide; 
biotinylated peptide + non-biotinylated competitive peptide or no peptide. Incubation 
was conducted using Peptide Binding Buffer (100 mM Citrate/Phosphate pH 4.5, 5 mM . 
EDTA, 1 mM PMSF, 100 \iM Leupeptin, 1 mM Iodoacetamide, 100 pM Pepstatin A, 1 

20 mM Benzamidine) at 37°C for 24h. Cells were collected by centrifugation, then 80 jxl 
supernatant was removed and replaced with 80 jal/well of NP40 lysis buffer (0.5% NP40, 
150 mMNaCl, 1 mM PMSF, 100 \xM Leupeptin, 1 mM Iodoacetamide, 100 pM 
Pepstatin A, 1 mM Benzamidine 50 mM Tris -HC1 pH to 8.0). Cells were incubated at 
4°C for 45 minutes and a cleared lysate obtained by centrifugation. 50 \i\ 

25 (triplicate/sample) of lysate was added to each well of a pre-coated 96 well assay plate 
containing 50 \x\ PBS 15% BSA. In all experiments pre-coating had been conducted the 
night before where assay plates had been pre-treated at 4°C with 100 (il/well anti-class II 
antibody LB3.1 diluted to 20 |Ltg/ml in PBS. Excess antibody was removed and the plate 
blocked with 250 pil/well of PBS/5% BSA for 2 hours at room temperature. Each plate 

30 - was washed 7x with PBS/1% Tween and 50 \x\ of PBS/5% BSA/0.5% NP40 added to 
each well before addition of the cell lysates. 
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Following addition of the lysates, plates were incubated at 4°C for 2 hours, then washed 
7x with PBS/0.1% Tween. 100 pJ of Streptavidin Peroxidase diluted 1:1000 in PBS/5% 
BSA/ 0.1% Tween was added to each well and the plates incubated at 4°C for 1 hour. 
Plates were washed 7x with PBS/0.1% Tween and 100 \i\ of chemiluminescent substrate 
5 (Amersham Pharmacia) added to each well. Plates were read using a Perkin Elmer 
MicroBeta® TriLux plate reader and results were given in CPS. 

In the competition analyses: maximum binding of biotinylated peptide was defined as the 
binding (CPS) occurring in the absence of the competitor peptide, and inhibition by the 
10 formula: 


% Inhibition = 100 x fYCPS with no competitor) - (CPS with competitor)! 

[CPS with no competitor] 

15 The concentration of competitor peptide causing 50% inhibition of maximun biotinylated 
peptide binding was taken as the ICso- 

The binding assays conducted on the panel of 13-mer peptides as listed in FIGURE 5 are 
depicted in FIGURE 6a-d. With the exception of peptides shown in FIGURE 6d, all 
20 peptides indicate a binding interaction with one or more of the human MHC class II 
allotypes tested. 

EXAMPLE 3 

The interaction between MHC, peptide and T-cell receptor (TCR) provides the structural 
25 basis for the antigen specificity of T-cell recognition. T-cell proliferation assays test the 

binding of peptides to MHC and the recognition of MHC/peptide complexes by the TCR. 

In vitro T-cell proliferation assays of the present example, involve the stimulation of 

peripheral blood mononuclear cells (PBMCs), containing antigen presenting cells (APCs) 

and T-cells. Stimulation is conducted in vitro using synthetic peptide antigens, and in 
30 some experiments whole protein antigen. Stimulated T-cell proliferation is measured 

using 3 H-thymidine ( 3 H-Thy) and the presence of incorporated 3 H-Thy assessed using 

scintillation counting of washed fixed cells. 
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Buffy coats from human blood stored for less than 12 hours were obtained from the 
National Blood Service (Addenbrooks Hospital, Cambridge, UK). Ficoll-paque was 
obtained from Amersham Pharmacia Biotech (Amersham, UK). Serum free AIM V 
media for the culture of primary human lymphocytes and containing L-glutamine, 
5 50jxg/ml streptomycin, 10|xg/ml gentomycin and 0.1% human serum albumin was from 
Gibco-BRL (Paisley, UK). Synthetic peptides were obtained from Eurosequence 
(Groningen, The Netherlands) and Babraham Technix (Cambridge, UK). 

Erythrocytes and leukocytes were separated from plasma and platelets by gentle 

10 centrifugation of buffy coats. The top phase (containing plasma and platelets) was 
removed and discarded. Erythrocytes and leukocytes were diluted 1:1 in phosphate 
buffered saline (PBS) before layering onto 15ml ficoll-paque (Amersham Pharmacia, 
Amersham UK). Centrifugation was done according to the manufacturers recommended 
conditions PBMCs were harvested from the serum+PBS/ficoll paque interface. PBMCs 

15 were mixed with PBS (1:1) and collected by centrifugation. The supernatant was 

removed and discarded and the PBMC pellet resuspended in 50ml PBS. Cells were again 
pelleted by centrifugation and the PBS supernatant discarded. Cells were resuspended 
using 50ml AIM V media and at this point counted and viability assessed using trypan 
blue dye exclusion. Cells were again collected by centrifugation and the supernatant 

20 discarded. Cells were resuspended for cryogenic storage at a density of 3xl0 7 per ml. 

The storage medium was 90%(v/v) heat inactivated AB human serum (Sigma, Poole, UK) 
and 10%(v/v) DMSO (Sigma, Poole, UK). Cells were transferred to a regulated freezing 
container (Sigma) and placed at -70°C overnight. When required for use, cells were 
thawed rapidly in a water bath at 37°C before transferring to 10ml pre- warmed AIM V 

25 medium. 

PBMC were stimulated with protein and peptide antigens in a 96 well flat bottom plate at 
a density of 2xl0 5 PBMC per well. PBMC were incubated for 7 days at 37°C before 
pulsing with 3 H-Thy (Amersham-Phamacia, Amersham, UK). For the present study, 
30 synthetic peptides (15mers) that overlapped by 3aa increments were generated that 
spanned the entire sequence of IFNa. Peptide identification numbers (ID#) and 
sequences are given in FIGURE 7. Each peptide was screened individually against 
PBMCs isolated from 20 naive donors. Two control peptides that have previously been 
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shown to be immunogenic and a potent non-recall antigen KLH were used in each donor 
assay. 

The control antigens used in this study were as below: 


Peptide 

Sequence 

C-32 

Biotin-PKYVKQNTLKLAT 
Flu haemagglutinin 307-319 

C-49 

KWDQIKKISKPVQH 
Chlamydia HSP 60 peptide 

KLH 

Whole protein from Keyhole 
Limpet Hemocyanin. 


5 

Peptides were dissolved in DMSO to a final concentration of lOmM, these stock solutions 
were then diluted 1/500 in AIM V media (final concentration 20|iM). Peptides were 
added to a flat bottom 96 well plate to give a final concentration of 2 and 20|LiM in a 
100|il. The viability of thawed PBMC's was assessed by trypan blue.dye exclusion, cells 

10 were then resuspended at a density of 2xl0 6 cells/ml, and 100|xl (2xl0 5 PBMC/well) was 
transferred to each well containing peptides. Triplicate well cultures were assayed at each 
peptide concentration. Plates were incubated for 7 days in a humidified atmosphere of 
5% C0 2 at 37°C. Cells were pulsed for 18-21 hours with l^Ci 3 H-Thy/well before 
harvesting onto filter mats. CPM values were determined using a Wallac microplate beta 

15 top plate counter (Perkin Elmer). Results were expressed as stimulation indices, 
determined using the following formula: 

Proliferation to test peptide CPM 
Proliferation in untreated wells CPM 
A stimulation index of 2 or greater was taken as positive stimulation in this assay. 

20 Mapping T cell epitopes in the IFNa sequence using the T cell proliferation assay 
resulted in the identification of three immunogenic regions Rl, R2, R3. This was 
determined by T cell proliferation in 6 donors that responded to peptides in one or more 
of these regions. Region 3 is considered to contain a potential immunodominant T-cell 
epitope as proliferation is scored in 4 of 6 donors that responded to IFNa peptides. 

25 Regions 1 and 2 induce T-cell proliferation in certain individuals. The cumulative 
response data for the responding individuals is depicted in FIGURE 8, and data from 
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individual responders is summarized in FIGURE 9. The stimulation index for individual 
donors is shown in FIGURE 15. The epitope data for INFa and indicating Rl, R2 and R3 
together with the individual peptide/donor responses is depicted in FIGURE 10. 

5 EXAMPLE 4 

The tissue types for all PBMC samples used in EXAMPLE 3 were assayed using a 
commercially available reagent system (Dynal, Wirral, UK). Assays were conducted in 
accordance with the suppliers recommended protocols and standard ancillary reagents and 
agarose electrophoresis systems. Allotypic coverage for DRB1 alleles was 70% in the 20 

10 donors tested. Results of the tissue typing were used to assess the frequency of INFcx2 
peptide responders carrying specific MHC class II alleles. Allotypic restriction of a given 
peptide is determined by the frequency of an allele in the donor population and the 
number of responding donors that express the same allele. If a peptide is associated with 
any particular allele then the frequency (expressed as a percentage) is expected to be 

15 greater than the frequency of the allele in the population. Results of such an analysis is 
given in FIGURE 11. In general the small numbers preclude rigorous statistical 
examination, however the data indicate a possible association of peptides from the epitope 
region defined as R3 with the DRB4*01 allotype 

20 EXAMPLE 5 

MHC Peptide binding assays were conducted using synthetic peptides containing 
sequences derived from the major immunogenic regions identified using the biological 
assay of EXAMPLE 3. In these assays synthetic 15-mer peptides were tested for their 
ability to bind three MHC allotypes in competition with a biotinylated competitor peptide. 
25 Assays were conducted broadly as detailed in EXAMPLE 2 and IC 5 o values calculated 
from binding curves derived from six concentration ratios of competitor to test peptide. 
The IC 50 values for each peptide / allotype combination tested are shown in FIGURE 12a- 
12c. These data indicate that peptides capable of stimulating T-cell proliferation in an in 
vitro biological assay may be of low or high affinity MHC class II ligands. 


EXAMPLE 6 

A number of modified IFNa2 molecules were made using conventional recombinant 
DNA techniques. A wild-type INFoc2b gene was cloned from human placental DNA and 
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the gene was used both as a control reagent, and a template from which to derive 
modified INFa2b genes by site-directed mutagenesis. Wild-type and modified genes 
were inserted into a eukaryotic expression vector and the recombinant INFoc2 proteins 
expressed as fusion protein with the human immunoglobulin constant region domain. 
5 Recombinant proteins were prepared from transiently transfected human embryonic 
kidney cells and assayed as detailed in EXAMPLE 7 

Briefly, the wild-type INFa2b gene was amplified from human placental DNA (Sigma, 
Poole, UK) using the polymerase chain reaction (PCR). The gene contains no introns and 
was readily amplified using forward and reverse primers OL177 and OL178 containing 
10 restriction sites to facilitate cloning as given below: 

OL177(EcoRI) 

5 ' CCGGAATTCGCTAGCTGCCCAGCCGGCGATGGCCTGTGATCTGCCTCAAACCCACAGCC-3 ' 

15 OL178 (XhoI/BamHI) 

5 ' -CCGGGATCCCTCGAGCTATTATTCCTTACTTCTTAAACTTTCTTGCAAG-3 ' 

The PCR product of 550 bp was digested with EcoRI and BamHI and cloned into the 
pLITMUS28 vector (NEB, UK Ltd.). The sequence was confirmed to be that of interferon 

20 alpha 2b by analysis of a number of positive clones. Li order to obtain expression from 
human embryonic kidney cells, the wild-type gene was re-cloned into vector pd-Cs [Lo, 
et al (1998), Protein Engineering 11: 495]. The pd-Cs vector directs the expression of a 
fusion protein containing the human immunoglobulin constant region domain. Cloning to 
this vector was achieved using PCR and primers OL232 and OL178. These primers 

25 provide cloning sites for use with enzymes Xmal and BamHI as below: 

OL232 (Xmal) 

5 ' CTGTCCCCGGGTAAATGTGATCTGCCTCAGACCCACAGCC 3 ' 

30 OL178 (XhoI/BamHI) 

5 ' -CCGGGATCCCTCGAGCTATTATTCCTTACTTCTTAAACTTTCTTGCAAG-3 ' 

The PCR product of 530 bp was digested with Xmal and BamHI, purified using a Qiagen 
gel extraction kit and transferred into prepared pd-Cs from which the IFN(L) sequence 
35 had been removed using Xmal and BamHI. A positive clone was selected and the 

INFa2b sequence confirmed by sequence analysis. The pd-Cs vector containing the wild- 
type INFoc2b gene was termed pCEFN5. 
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Single or multiple codon mutations to generate modified INFcc2 genes is conducted by 
mutagenic PCR using pCEFNS as a template. Overlap PCR was used to combine the two 
mutated halves of the interferon sequence. This fragment is then cloned into an 
intermediate vector (pGEM-T EASY vector; Promega, UK) for sequence analysis prior to 
5 being transferred into the pd-Cs derived expression vector using Xmal and BamHI as 
described above. 

Mutagenesis was conducted using flanking primers OL235 and OL234 in separate 
reactions in combination with specific mutagenic (mis-matched) primers and the pCIFN5 
template DNA. 

10 

OL234: 5 1 -CTCATGCTCCGTGATGCATGAGGC 
OL23 5 : 5 9 -C ACTGC ATTCTAGTTGTGGTTTGTC 

Reactions were conducted using Expand HI Fidelity PCR reagents (Roche,GmbH) and 
15 reaction conditions specified by the following cycle: 

94°C/2' + 25 Cycles @ 94°C/30", 60°C/30'\ 72°C/30" + 72°C/10' 

The products of the separate reactions were joined by PCR in a reaction driven by primers 
20 OL235 and OL234 using 15 cycles of PCR as above. 

PCR products were gel purified using commercially available kit systems (Qiagen gel 
extraction kit). The products were cloned using a T/A cloning system into vector pGEM- 
T EASY (Promega, UK) and a number of clones were sequenced in each case to confirm 
25 the successful introduction of the desired mutation. 

The desired clones were digested with BamHI and Xmal and the purified product ligated 
into a prepared pd-Cs vector. Cloning was conducted using E.coli XLl-Blue cells 
(Strategene Europe) and culture conditions recommended by the supplier. Sequence 
30 confirmation was conducted on all final vector preparations using OL261 and OL234 as 
sequencing primers. 

OL 261 5 ' - GGTG AC AG AG AC TC C C C TG ATG AAG 3 ' 
35 OL234: 5 ' -CTCATGCTCCGTGATGCATGAGGC 3 ' 
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Expression of modified INFa2 human IgFC fusion proteins was achieved using HEK293 
human embryonic kidney cell line as the expression host. All DNA for transfection was 
prepared using the high purity CONCERT midiprep system and instructions provided by 
the supplier (Invitrogen, Paisley, UK). DNA is filter sterilised prior to use and quantified 
5 by measurment of the A260. Concentrations were adjusted to 0.5-1.0 \Lg/\i\. 

For transient expression, HEK293 were grown using D-MEM glutamax medium 
(Invitrogen, Paisley, UK) supplemented with 10% PCS and 250 ^Lg/ml geneticin. Prior to 
transfection, cells were collected by treatment with trypsin and washed using PBS. After 
10 2 cycles of washing cells are taken into fresh medium at a density of 4 x 10 5 cells/ml, and 
plated into multiwell dishes pre-treated with poly-l-lysine to ensure good cell adhesion. 
Typically, 2 x 10 s cells are added to each well of a 48 well plate and the plates incubated 
overnight at 37°C/5%C0 2< 

15 Prior to transfection, the medium is replaced in each well and the transfection mixes 
added. Transfection is conducted using the lipofectamine reagent and instructions 
provided by the supplier (Invitrogen, Paisley, UK). Briefly, transfection mixes are 
prepared containing lipofectamine, OPTI-MEM (Invitrogen, Paisley, UK) and 0.8 jag 
DNA per well for each expression vector construct. Transfection mixes are added to the 

20 cells and the cells incubated for 4-6 hours. The medium is replaced with 0.5 ml fresh 
media and the cells incubated at 37°C/5%C02. Samples were taken after 48 hours for 
analysis by both anti-FC ELISA and Daudi cell proliferation assay. The media was 
harvested after 7 days and stored at 4C for further analysis as above. 

25 The medium is assayed for the presence of INFa2 using a commercially available ELISA 
system and instructions provided by the supplier (R& D systems, UK). In some instances 
an ELISA detecting the human immunglobulin constant region domain of the IFNoc- 
fusion protein was applied. For this assay a mouse anti-human IgG Fc preparation 
(Sigma, Poole, UK) is used as a capture reagent. The INFa-HuFc fusion is quantitated 

30 with reference to a standard curve generated using a dilution series of a reference human 
IgG preparation (Sigma). Bound ESTFa-FC fusion or the reference protein is detected 
using an anti-human IgG peroxidase conjugate (Sigma) and Sigma OPD colourimetric 
substrate. 
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Following estimation of the amount of INFot in the HEK293 conditioned medium, the 
conditioned medium is used directly to test the functional activity of the modified INFcc 
using the anti-proliferation assay as detailed in EXAMPLE 7. 

5 EXAMPLE 7 

Modified interferon molecules of the present invention were tested for their ability to 
inhibit the growth of human B cell lymphoma line Daudi. The method is broadly as 
described previously [Mark, D.F. et al (1984) Proc. Natl Acad. Set USA 81 : 5662-5666] 
and involves incubation of Daudi cells with the test interferon. The antiproliferative 
10 effect of the test molecule is measured using a soluble dye substance which undergoes a 
colour change in the presence of proliferating cells. The induced colour change is 
measured in a spectrophotometer and any antiproliferative effect is computed with 
reference to the colour change recorded in non-treated control cells and cells treated with 
a standard interferon preparation. 

15 

Briefly, Daudi cells (ATCC # CCL-213) were cultured RPMI 1640 Media supplemented 
with 100 units/ml Penicillin/ 100 ug /ml Streptomycin and 2 mM L-Glutamine and 20% 
Fetal Bovine Serum (FBS). All media and supplements were from Gibco (Paisley, UK). 
The day before assay, cells are replaced into fresh medium at a density 0.9xl0 6 /ml and 
20 next day replaced into fresh medium as above except containing 10%(v/v) FBS. The cell 
density is adjusted to be 2x 10 5 cells/ml. 

The test and control interferon preparations are diluted into RPMI containing 10% FBS. 
Dilutions are made into 96- well flat bottom plates to contain 100ul/ well and all samples 
are set up in triplicate. Typically doubling dilution series are set out across each plate. 

25 Positive control wells are also included in triplicate with a starting concentration of the 
interferon standard (NIBSC, South Minims, UK) at 10000 pg/ml. Control wells 
containing lOOul media alone (no interferon) are also included. lOOul of the cells are 
added to each well, and the plates incubated for 72 hours at 37°C, 5% C0 2 . 
Proliferation is assessed using Aqueous One reagent system and the suppliers 

30 recommended protocol (Promega, Southampton, UK). Briefly, 40 \il of the Aqueous One 
reagent is added to all wells and the substrate mixed. Plates are incubated at 37°C for 
one hour, and then transferred to the plate reading instrument for determination of the 
light absorbance. Readings are taken at 490nm. Average absorbance at 490 nm is plotted 
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on the Y axis versus concentrations of interferon standard added along the X axis. 
Interferon concentration is determined using a ELIS A techniques as detailed in 
EXAMPLE 6. The resulting calibration curve is used to determine the ED50 value for 
each test sample. 

5 Results of such an analysis according to the above method for a number of modified 
INFoc2 molecules are depicted in FIGURE 14. The results indicate retained anti- 
proliferative properties in the presence of amino acid substitutions within the INFa2 
sequence. 
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Patent Claims 


L A modified molecule having the biological activity of human interferon alpha 2 
(INFcc2) and being substantially non-immunogenic or less immunogenic than any 
5 non-modified molecule having the same biological activity when used in vivo. 

2. A molecule according to claim 1, wherein said loss of immunogenicity is achieved 
by removing one or more T-cell epitopes derived from the originally non-modified 
molecule and / or by reduction in numbers of MHC allotypes able to bind peptides 

10 derived from said molecule.. 

3. A molecule according to claim 2, wherein one T-cell epitope is removed. 

4. A molecule according to any of the claims 2-4, wherein said originally present T- 
15 cell epitopes are MHC class II ligands or peptide sequences which show the ability 

to stimulate or bind T-cells via presentation on MHC class II. 

5. A molecule according to claim 4, wherein said ligands or peptide sequences are 
13mer or 15mer peptides. 

20 

6. A molecule according to claim 5, wherein said peptide sequences are selected from 
the group as depicted in Figure 1. 

7. A molecule according to any of the claims 2-6, wherein 1-9 amino acid residues 
25 in any of the originally present T-cell epitopes are altered. 

8. A molecule according to claim 7, wherein one amino acid residue is altered. 

9. A molecule according to claim 7 or 8, wherein the alteration of the amino acid 

30 residues is substitution of originally present amino acid(s) residue(s) by other amino 

acid residue(s) at specific position(s). 


i 
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10. A molecule according to claim 9, wherein one or more of the amino acid residue 
substitutions are carried out as indicated in Figure 2. 

11. A molecule according to claim 10, wherein additionally one or more of the amino 

5 acid residue substitutions are carried out as indicated in Figure 3 for the reduction in 

the number of MHC allotypes able to bind peptides derived from said molecule. 

12. A molecule according to claim 9, wherein one or more amino acid substitutions are 
carried as indicated in Figure 3. 

10 

13. A molecule according to claim 7 or 8, wherein the alteration of the amino acid 
residues is deletion of originally present amino acid(s) residue(s) at specific 
position(s). 

15 14. A molecule according to claim 7 or 8, wherein the alteration of the amino acid 
residues is addition of amino acid(s) at specific position(s) to those originally 
present. 

15. * A molecule according to any of the claims 7 to 14, wherein additionally further 
20 alteration is conducted to restore biological activity of said molecule. 

16. A molecule according to claim 15, wherein the additional further alteration is 
substitution, addition or deletion of specific amino acid(s). 

25 17. A modified molecule according to any of the claims 7 - 16, wherein the amino acid 
alteration is made with reference to an homologous protein sequence. 

18. A modified molecule according to any of the claims 7 - 16, wherein the amino acid 
alteration is made with reference to in silico modeling techniques. 

30 

19. A modified molecule according to any of the claims 7-16, wherein the amino acid 
alteration is made with reference to stimulation or binding of T cells by INFoc2 
derived peptides or INFa2 protein. 
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20. A modified molecule having the biological activity of human interferon alpha 2 
(INFcc2) and being substantially non-immunogenic or less immunogenic than any 
non-modified molecule having the same biological activity when used in vivo, 
obtainable by alteration of one or more amino acids in the primary sequence by (i) 

5 removing one or more T-cell epitopes derived from the originally non-modified 

molecule and being MHC class II ligands or peptide sequences which show the 
ability to stimulate or bind T-cells via presentation on MHC class II, and / or (ii) by 
reduction in numbers of MHC allotypes able to bind peptides derived from said 
molecule, 

10 wherein said modified molecule comprises alterations which are made at one or 

more positions within following strings of contiguous amino acid residues of said 
primary sequence derived from the HSfFa2 wild-type: 

(a) ISLFSCLKDRHDFGFPQEEFGNQFQKAETIPVLH (Rl), 

(b) FNLFSTKDSSAAWDE (R2), 

15 (c) KEDSILAVRKYFQRITLY (R3), 

21. A molecule according to claim 20, wherein said alteration is substitution of 1 - 9 
amino acid residues. 

20 22. A molecule according to claim 21, wherein said substitution is conducted at one or 
more amino acid residues from the strings Rl, R2 and R3. 

23. A molecule according to claim 21, wherein said substitution is conducted at one or 
more amino acid residues from the strings R2 and R3. 

25 

24. A molecule according to claim 21, wherein said substitution is conducted at one or 
more amino acid residues from the string R3. 

25. A molecule according to any of the claims 19 - 24, wherein additionally one or 

30 more substitutions of amino acid residues outside the sequence strings Rl, R2 or R3 

are conducted. - - - 
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26. A molecule according to any of the claims 19-25 comprising an amino acid 
residue substitution made at one or more positions in the wild-type molecule: 

24, 26, 27, 38, 55, 63, 64, 66, 67, 76, 84, 85, 89, 103, 110, 111, 116, 117, 119, 122, 
123, 126, 128, 129, 130, 153. 

5 

27. A molecule according to claim 25 comprising an amino acid residue substitution 
made at one or more positions in the wild-type molecule: 

26, 27, 38, 63, 85, 89, 103, 110, 111, 116, 117, 122, 123, 126, 128, 153 

10 28. A molecule according to claim 27, wherein said substitution is made at one or more 
positions selected from 26, 27, 38. 


29. A molecule according to claim 27, wherein said substitution is made at position 63, 
85 or 89. 

15 

30. A molecule according to claim 27, wherein said substitution is made at one or more 
positions selected from 103, 110, 111, 116, 117, 122, 123, 126, 128, 153. 

31. A molecule according to claim 27, wherein said substitution is made at one or more 
20 positions selected from 26, 27, 38, and additionally at position 63, 85 or 89. 

32. A molecule according to claim 27, wherein said substitution is made at one or more 
positions selected from 26, 27, 38 or alternatively from 63, 85, 89 and additionally 
atone or more positions selected from 103, 110, 111, 116, 117, 122, 123, 126, 128, 

25 153. 


33. A molecule according to claim 26, wherein said substitution is made at one or more 
positions as specified in Figure 4. 

30 34. A molecule according to claim 28, wherein said substitution is selected from 
L26P, F27S, F38E. 

35. A molecule according to claim 29, wherein said substitution is selected from 
I63T, Y85S, Y89D, Y89E, Y89N. 
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36. A molecule according to claim 30, wherein said substitution is selected from 

V103E, L110G, Mil IT, Mil IS, Ml HE, I116S, I116Q, L117G, L117A, Y122E, 
Y122Q, F123H, I126A, L128A, L153S. 

5 37. A modified molecule having the biological activity of human interferon alpha 2 
(INFot2) and being substantially non-immunogenic or less immunogenic than any 
non-modified molecule having the same biological activity when used in vivo, 
obtainable by substitution of one or more amino acids in the primary sequence by 
(i) removing one or more T-cell epitopes derived from the originally non-modified 

10 molecule and being MHC class II ligands or peptide sequences which show the 

ability to stimulate or bind T-cells via presentation on MHC class II, and / or (ii) by 
reduction in numbers of MHC allotypes able to bind peptides derived from said 
molecule, wherein said substitution is made at one or more positions in a wild-type 
molecule INFa2a or INFa2b corresponding to at least one of the groups selected 

15 from: 

(i) I24P, L26P, F27S, F38E, V55A, 

(ii) I63T, L66A, F67D, F67E, W76H, F84D, F84E, Y85S, Y89D, Y89E, Y89N, 

(iii) any position within sequence R3, including L153S 

20 38. A molecule of claim 37, whereby one or more of the following substitutions are 
made within sequence R3: 

V103E, LI 10G, LI 10S, Ml 1 IT, Ml 1 IS, Ml 1 IE, II 16S, II 16Q, LI 17G, LI 17A, 
V119A, Y122Q, Y122E, Y122H, F123H, I126A, L128A, Y129N, L130G, L130, 
L153S 


25 


39. A molecule of claim 38, whereby one or more of the following substitutions are 
made: 

Y122E, Y122Q, F123H, I126A, L128A. 


30 40. 


A molecule according to any of the claims 36 
substitutions are made. 


- 39, whereby additionally 
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41. A modified human interferon alpha 2 (INFa2) having reduced immunogenicity 

consisting of the following sequence: 
CDLPQTHSLGSRRTLMLLAQMRX°ISLFSCL 
QQIFNLFSTKDSSAAWDETLLDKFYTELYQ 
5 X r X 2 QIOC 3 TXVLKEKKYSPCAWEVVRAEIMRSFSLSTNLQESLRSKE , 

wherein X° is R, K; 

X 1 is Y, E, Q; 

X 2 is F, H; 

X 3 is I, A; and 
10 X 4 is L, A; 

whereby simultaneously X 1 = Y, X 2 = F, X 3 = I and X 4 = L are excluded. 

42. A modified human interferon alpha 2 (ESTFoc2) having reduced immunogenicity 

consisting of the following sequence: 
15 CDLPQTHSLGSRRTLMLLAQMRX°ISLFSCLKDRHDFGFPQEEFGNQFQKAETIPVLHEMI 

qqifnlfstkdssaawdetlldkfytelyqqlndleacviqgvgvtetpx 1 x 2 kedsx 3 x*av 
rkx s x 6 qrx 7 txVlkekkyspcawevvraeimrsfsx 9 stnlqeslrske, 

wherein X° is R, K; 

X 1 is L, S, G, 

20 X 2 isM,T, S,E, 

X 3 is I, S, Q, 

X 4 is L, G, 

X 5 is Y, E, Q; 

X 6 is F, H; 

25 X 7 is I, A; 

X 8 is L, A; and, 
X 9 is L, S 

whereby simultaneously X 1 = L, X 2 = M, X 3 = I, X 4 = L, X 5 = Y, X 6 = F, X 7 = I, 
X 8 = L and X 9 = L are excluded. 

30 

43. A modified human interferon alpha 2 (ESfFa2) having reduced immunogenicity 
consisting of the following sequence: 

CDLPQTHSLGSRRTLMLLAQMRX°ISLFSCLKDRHDFGFPQEEFGNQFQKAETIPVLHEMI 
QQX l X a NX 3 X 4 STKDS S AAX S DETLLDKX 6 X 7 TELX 8 QQLNDLE ACVIQGVGVTETPLMKEDS IL 
35 AVRKYFQRITLYLKEKKYSPCAWEWRAEIMRSFSLSTNLQESLRSKE , 
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wherein X° is R, K; 
X 1 is I,T; 
X 2 is F, D, A; 
X 3 is L,A; 
5 X 4 is F,D,E; 

X 5 is W,H; 
X 6 is F, D, E; 
X 7 is Y, S and 
X 8 is Y, D, E, N; 

10 whereby simultaneously X 1 = I, X 2 = F, X 3 = L, X 4 = F, X 5 = W, X 6 = F, X 7 = Y 

and X 8 = Y are excluded. 


44. A modified human interferon alpha 2 (INFa2) having reduced immunogenicity 
consisting of the following sequence: 
15 CDLPQTHSLGSRRTLMLLAQMRX°ISLFSCLKDRHDFGFPQEEFGNQFQKAETIPVLHEMI 
QQX 1 FNLFSTKDSSAAWDETLLDKFX 2 TELX 3 QQLZSTDLEACVIQGVGVTETPLMKEDSILAV 
RKYFQRITLYLKEKKYSPCAWEVVRAEIMRSFSLSTNLQESLRSKE, 

wherein X° is R, K; 

X 1 is I,T; 
20 X 2 is Y, S and 

X 3 is Y, D, E, N; , 
whereby simultaneously X 1 = I, X 2 = Y and X 3 = Y are excluded. 


45. A modified human interferon alpha 2 (INFot2) having reduced immunogenicity 
25 consisting of the following sequence: 

CDLPQTHSLGSRRTLMLLAQMRX°ISX 1 X 2 SCLKDRHDFGX 3 PQEEFGNQFQKAETIPX 4 LHE 
f MIQQIFNLFSTKDSSAAWDETLLDKFYTELYQQLNDLEACVIQGVGVTETPLMKEDSILAV 
RKYFQRITLYLKEKKYSPCAWEVVRAEIMRSFSLSTNLQESLRSKE , 

wherein X° is R, K; 
30 X 1 is; L,P, 

X 2 is; F,S, 
X 3 is F, E and 
X 4 is; V, A 

whereby simultaneously X 1 = L, X 2 = F, X 3 = F and X 4 = V are excluded. 
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46. A modified INFa2 sequence according to claim 41, wherein additional substitutions 
are made. 

5 47. A modified INFa2 sequence according to claim 46 containing in addition 
substitutions according to claim 43. 

48. A modified INFa2 sequence according to claim 46 containing in addition 
substitutions according to claim 44. 

10 

49. A modified INFoc2 sequence according to claim 47 or 48 containing in addition 
substitutions according to claim 45. 

50. A modified INFoc2 sequence according to claim 42 containing additional 
15 substitutions. 

51. A modified INFa2 sequence according to claim 50 containing in addition 
substitutions according to claim 43. 

20 52. A modified INFa2 sequence according to claim 50 containing in addition 
substitutions according to claim 44. 

53. A modified INFa2 sequence according to claim 51 or 52 containing in addition 
substitutions according to claim 45. 

25 

54. A modified HMFa2 sequence according to claim 1, wherein additional substitutions 
are made at one ore more positions within partial sequence Rl and / or R2 and /or 
R3, wherein Rl, R2, R3 are defined as specified in claim 19. 

30 55. A DNA sequence coding for a modified INFcc2 of any of the claims 1 - 54. 
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56. A pharmaceutical composition comprising a modified molecule having the 

biological activity of INFa2 as defined in any of the above-cited claims, optionally 
together with a pharmaceutically acceptable carrier, diluent or excipient. 


5 57. A method for manufacturing a modified molecule having the biological activity of 
INFa2 as defined in any of the claims of the above-cited claims comprising the 
following steps: 

(i) determining the amino acid sequence of the polypeptide or part thereof. 

(ii) identifying one or more potential T-cell epitopes within the amino acid sequence 
10 of the protein by any method including determination of the binding of the peptides 

to MHC molecules using in vitro or in silico techniques or biological assays; 

(iii) designing new sequence variants with one or more amino acids within the 
identified potential T-cell epitopes modified in such a way to substantially reduce or 
eliminate the activity of the T-cell epitope as determined by the binding of the 

15 peptides to MHC molecules using in vitro or in silico techniques or biological 

assays ,or by binding of peptide-MHC complexes to T-cells. 

(iv) constructing such sequence variants by recombinant DNA techniques and 
testing said variants in order to identify one or more variants with desirable 
properties; and 

20 (v) optionally repeating steps (ii) - (iv). 


58. A method of claim 57, wherein step (iii) is carried out by substitution, addition or 
deletion of 1 - 9 amino acid residues in any of the originally present T-cell epitopes. 

25 59. A method of claim 57, wherein the alteration is made with reference to an 
homologous protein sequence and / or in silico modeling techniques. 


60. A method of any of the claims 57 - 59, wherein step (ii) is carried out by the 

following steps: (a) selecting a region of the peptide having a known amino acid 
30 residue sequence; (b) sequentially sampling overlapping amino acid residue 

segments of predetermined uniform size and constituted by at least three amino acid 
residues from the selected region; (c) calculating MHC Class II molecule binding 
score for each said sampled segment by summing assigned values for each 


WO 02/085941 


- 55 - 


PCT/EP02/02218 


hydrophobic amino acid residue side chain present in said sampled amino acid 
residue segment; and (d) identifying at least one of said segments suitable for 
modification, based on the calculated MHC Class II molecule binding score for that 
segment, to change overall MHC Class II binding score for the peptide without 
5 substantially the reducing therapeutic utility of the peptide. 

61. A method of claim 60, wherein step (c) is carried out by using a Bohm scoring 
function modified to include 12-6 van der Waal's ligand-protein energy repulsive 
term and ligand conformational energy term by (1) providing a first data base of 

10 MHC Class II molecule models; (2) providing a second data base of allowed 

peptide backbones for said MHC Class II molecule models; (3) selecting a model 
from said first data base; (4) selecting an allowed peptide backbone from said 
second data base; (5) identifying amino acid residue side chains present in each 
sampled segment; (6) determining the binding affinity value for all side chains 

15 present in each sampled segment; and repeating steps (1) through (5) for each said 

model and each said backbone. 


62. A peptide molecule consisting of 13 consecutive amino acid residues having a 
potential MHC class II binding activity and created from the primary sequence of 

20 non-modified USfFa2 , selected from the group as depicted in Figure 1, Figure 6a, 

6b, 6c. 

63. A peptide molecule consisting of 15 consecutive amino acid residues having a 
potential MHC class II binding activity and created from the primary sequence of 

25 non-modified INFa2, selected from any of the groups of partial sequences Rl, R2, 

R3 as specified in claim 19. 


64. A peptide molecule according to claim 63 selected from Figure 7. 


30 65. A peptide sequence consisting of at least 9 consecutive amino acid residues selected 
from the T-cell epitope peptides as specified in claims 62 - 64. 
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66. A peptide molecule consisting of 9 - 15 consecutive amino acid residues, having a 
potential MHC class II binding activity and created from the primary sequence of 
non-modified INFa2, whereby said molecule has a stimulation index of at least 1.8 
in a biological assay of cellular proliferation wherein said index is taken as the 
5 value of cellular proliferation scored following stimulation by a peptide and divided 

by the value of cellular proliferation scored in control cells not in receipt peptide 
and wherein cellular proliferation is measured by any suitable means according to 
standard methods. 


10 67. A peptide molecule according to claim 66 consisting of a sequence as indicated in 
Figure 6 or 7, 

68. A peptide molecule according to claim 66 comprising at least 9 consecutive amino 
acid residues from any of the INFa2 partial sequences Rl, R2, R3 as defined in 

4 

15 claim 19. 


69. Use of a peptide according to any of the claims 62 - 68, for the manufacture of 
modified INFoc2 molecules having substantially no or less immunogenicity than any 
non-modified molecule with the same or acceptably reduced degree of biological 

20 activity when used in vivo, 

70. A pharmaceutical composition consisting of a synthetic peptide sequence as 
specified in any of the claims 62-68, optionally together with a pharmaceutically 
acceptable carrier, diluent or excipient. 


25 
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FIGURE 1 


Peptide sequences in human INFoQb with potential human MHC class II binding activity. 
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FIGURE 2 

Substitutions leading to the elimination ofT-cell epitopes of human INFo2 
(WT = wild type residue). 
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FIGURE 3 

Additional substitutions leading to the removal of a potential T-cell epitope for 1 or more 
MHC allotypes. 


residue 
# 

WT 








Substitutions 



64 

F 

W 

Y 









65 

N 

A 

C 

D 

E 

F 

G 

H 

I K L M 

p 

Q R S T V W Y 

66 

L 

M 

V 

W 

Y 







67 

F 

M 

V 

W 

Y 







68 

S 

A 

F 

G 

H 

I 

M 

P 

T V W Y 



70 

K 

A 

C 

H 

P 

T 






71 

D 

F 

H 

I 

L 

P 

T 

V 

W Y 



72 

S 

D 

H 

P 

T 

W 

Y 





73 

S 

A 

C 

G 

P 

T 






74 

A 

D 

P 

Q 

R 

S 

T 





75 

A 

C 

D 

E 

G 

H 

K 

N 

P Q R S 

T 

w 

76 

W 

M 










78 

E 

A 

C 

G 








112 

K 

A 

C 

G 

P 







114 

D 

P 










115 

S 

A 

c 

D 

F 

G 

H 

I 

P W Y 



116 

I 

Y 










117 

L 

M 

w 

Y 








118 

A 

P 

T 









118 

V 

A 

C 

G 

P 

T 






120 

R 

P 










124 

Q 

A 

C 

G 

H 

I 

M 

P 

T V W Y 



125 

R 

P 

T 

W 

Y 







126 

I 

M 

Y 









127 

T 

P 

W 

Y 








129 

Y 

M 











WO 02/085941 


PCT/EP02/02218 


4/19 

FIGURE 4 


Preferred substitutions in human INFoQ (WT = wild type residue, MUT = desired residue). 
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FIGURE 5 
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FIGURE 6a 


: : Strong Binding Peptides 
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FIGURE 6b 
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FIGURE 6c 
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FIGURE 9 


IFNoc IMMUNOGENIC REGIONS 


REGION 1 (Rl) 

Residue* 24 57 
ISLFSCLKDRHDFGFPQEEFGNQFQKAETIPVLH 
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R3 Cumulative Stimulation Index from responsive donors > 12 
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FIGURE 12a 
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FIGURE 13 
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FIGURE 15a 


Donor 3 Response to Stimulation with IQpM IFNoc Peptides 
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Donor 15 Response to Stimulation with lOuM IFNcc Peptides 
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FIGURE 15 c 


Donor 19 Response to Stimulation with 10\xM IFNa Peptides 
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FIGURE 15d 


Donor 6 Response to Stimulation with lO^M IFNa Peptides 
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FIGURE 15e 


Donor 16 Response to Stimulation with lOuM IFNa Peptides 
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